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GENES REGULATED IN OVARIAN CANCER AS PROGNOSTIC 
AND THERAPEUTIC TARGETS 

TECHNICAL FIELD 

[01] The present invention belongs to the fields of medicine and relates to the use of 
genomic analysis to evaluate and treat ovarian cancer. In particular, this invention relates to 
the measurement of patterns of gene expression to determine the presence of ovarian 
cancer in a patients tissues. 

BACKGROUND ART 

[02] Ovarian cancer is one of the most common types of cancer that affects women in 
the United States, with a lifetime risk of approximately 1/70. See Whittemore, Gynecol. 
Oncol., Vol. 55, No. 3, Part 2, pp. 315-819 (1994). It is a rapidly fatal disease usually 
detected late, with still no good method of prevention. The greatest risk factor for ovarian 
cancer is a family history of the disease, suggesting the strong influence of genetics. See 
Schildkraut and Thompson, Am. J. Epidemiol., Vol. 128, No. 3, pp. 456-466 (1988). Other 
factors such as demographic, lifestyle and reproductive factors have also been shown to 
contribute to the risk of ovarian cancer. 

[03] Several microarray expression analyses of ovarian biopsies and cell lines have been 
conducted to identify genes specifically over-expressed in ovarian cancers. See 
Schummer et al., Gene, Vol. 238, No. 2, pp. 375-385 (1999). Other studies have tried to 
correlate gene expression levels with specific tumor types. See Bayani et al., Cancer Res 
Vol. 62, No. 12, pp. 3466-3476 (2002); Welsh et al., Proc. Natl. Acad. Sci. USA, Vol. 98 
No. 3, pp. 1176-1181 (2001); and Onoet al., Cancer Res., Vol. 60, No. 18, pp. 5007-5011 
(2000). 

[04] These kinds of studies, aimed at Increasing our understanding of the molecular 
mechanism of tumor development and in some cases at better classifying tumors, has 
provided a list of genes with few overlaps between analyses. Some technical differences 
may explain in part the apparent lack of consistency or low reproducibility between studies: 
quality of samples, amplification of the messenger ribonucleic acid (mRNA) and different 
microarray platforms. However, it is likely that the heterogeneity of the tumors is a key factor 
that contributes to the differences observed between studies, in particular, those where few 
tumors are analyzed. Furthermore, comparison of gene expression levels on microarray 
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experiments have historically been done using ratios of signal intensity (fold change), with 
limited use of statistical methods and a lack of validation with additional samples. 
[05] However, genes apparently expressed at high levels, or with the biggest change in 
expression, may not always be the most relevant; it is conceivable that a small disruption of 
the very tight regulation of genes may have dramatic consequences, even when the level of 
expression is low. 

[06] Thus there is a need for the identification of genes whose expression rates are 
consistently and reliably altered in ovarian cancer. Such a list could provide new insight into 
ovanan tumor development and progression, and suggest potential new drug targets, and 
biomarkers for diagnosis, monitoring and treatment of the disease. 

DISCLOSURE OF INVENTION 

[07] m the present invention, the application of a combination of statistical tests and the 
recently described leave-one-out method [see van't Veer et al.. Nature, Vol. 415 No 6871 
PP. 530-536 (2002)]. allows the analyze of expression profiles of tumors and normal ovarian 
t.ssues and for these patterns to also be determined in gene expression products in various 
body fluids including, but not limited to. blood and serum. See van't Veer et al., supra. 
[08] The study of two independent sets of samples, a test set and a validation set 
confirms the involvement of several known genes with ovarian tumor development, but also 
identify novel genes. These findings provide new insight into ovarian tumor development 
and progression, and suggest potential new drug targets, and biomarkers for diagnosis and 
monitoring of the disease. 

[09] In one embodiment, this invention provides a method to determine if a patient is 
afflicted with ovarian cancer comprising: 

a) obtaining a sample from the said patient; 

b) determining the levels of gene expression of two or more of the genes listed in 
Table 9 in the sample from the patient; 

c) comparing the levels of gene expression of the two or more genes determined in 
(b) to the levels of the same genes listed in Table 1 ; 

d) determining the degree of similarity (DOS) between the levels of gene expression 
of the two or more genes determined in (c); and 

e) determining from the DOS between the level of gene expression of the two or 
more genes the probability that the sample shows evidence of the presence of 
ovarian cancer in the patient. 
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[10] In a preferred embodiment, this invention provides a method wherein the levels of 
gene expression are determined for a subset of the genes listed in Table 9 comprising genes 
Nos. 1-28 in Table 9. 

[1 1] In another embodiment, the invention employs a sample comprising cells obtained 
from the patient. These may be cells removed from a solid tumor in the said patient or, in a 
preferred embodiment, the sample comprises blood cells and serum drawn from the said 
patient. In a most preferred embodiment, the sample comprises a body fluid drawn from the 
patient. 

[12] In a preferred embodiment, this invention employs a method of determining the level 
of gene expression comprising measuring the levels of protein expression product in the 
sample from the patient. This may be done in a variety of ways including, but not limited to, 
detecting the presence and level of the protein expression products using a reagent which ' 
specifically binds with the proteins, wherein the reagent may be selected from the group 
consisting of an antibody, an antibody derivative and an antibody fragment. 
[13] In another embodiment, this invention provides a method wherein the levels of 
expression in the sample are assessed by measuring the levels in the sample of the 
transcribed polynucleotides of the two or more gene in Table 9. These transcribed 
polynucleotide may be mRNA or complementary DNA (cDNA). 

[14] In a preferred embodiment, this method would further include the step of amplifying 
the transcribed polynucleotide. 

[15] In another embodiment, this invention includes a method of treating a subject 
afflicted with ovarian cancer, the method comprising providing to cells of the subject an 
antisense oligonuceotide complimentary to one or more of the genes whose expression is 
up-regulated in ovarian cancer as shown in Table 8. 

[16] In addition, this invention provides a method of inhibiting ovarian cancer in a subject 
at risk for developing ovarian cancer, the method comprising inhibiting expression of one or 
more of the genes shown in Table 8 to be up-regulated in ovarian cancer. 
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[1 7] This invention also provides kits for use in determining treatment strategy for a 
patient with suspected ovarian cancer comprising: 

a) a number (for example, two or more) of antibodies able to recognize and bind to 
the polypeptide expression product of the two or more of the genes in Table 9; 

b) a container suitable for containing the said antibodies and a sample of body fluid 
from the said individual wherein the antibody can contact the polypeptide expressed 
by the two or more genes shown in Table 9 if they are present; 

c) means to detect the combination of the said antibodies with the polypeptides 
expressed by the two or more genes shown in Table 9; and 

d) instructions for use and interpretation of the kit results. 

[1 8] In another embodiment, this invention provides a kit for use in determining the 
presence or absence of ovarian cancer in a patient comprising: 

a) a number (for example, two or more) of polynucleotides able to recognize and bind 
to the mRNA expression product of the two or more genes shown in Table 9; 

b) a container suitable for containing the said polynucleotides and a sample of body 
fluid from the said individual wherein the said polynucleotide can contact the mRNA, if 
it is present; 

c) means to detect the levels of combination of the said polynucleotide with the 
mRNA from the two or more genes shown in Table 9; and 

d) instructions for use and interpretation of the kit results. 

BRIEF DESCRIPTION OF DRAWINGS 

[19] FIG. 1 (A). Re-Classification of Samples Using Increasing Number of Probe Sets. 
[20] FIG. 1 (B). Plot of Errors as a Function of Number of Probe Sets for Determination of 
Optimum Number of Classification Genes. Calculated values by increasing number of 
individual probe sets from the top 6 to the top 55 (A) or to all 900 (B). Arrow indicates the 
minimum number of probe sets (N=28) that minimizes misclassification. 
[21] FIG. 2. Determination of a Threshold CC Value for Classification of Ovarian Status. 
[22] FIG. 3. Correlation of Test and Validation Biopsy Profiles with Mean Normal Profile 
for Different Size Probe Sets. N or T represent Normal or Tumorstatus, respectively. V is 
the PCC value of the probe set profile of the corresponding biopsy sample with the mean 
Normal profile (Group 1). Samples are ordered from highest CC to lowest. 
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[23] FIG. 4. Correlation of Biopsy Profiles with Mean of All Normal Profiles for Different 
Size Probe Sets. N or T represent Normal or Tumor status, respectively. V is the PCC 
value of the probe set profile of the corresponding biopsy sample with the mean profile of all 
Normal samples. Samples are ordered from highest CC to lowest. 

MODES FOR CARRYING OUT THE INVENTION 

[24] The present invention provides methods to determine whether or not a sample from 
a patient including, but not limited to, biopsy tissue or blood, serum or some other body fluid 
from a patient, contains evidence of the presence of ovarian cancer in the patient. 
[25] This invention is based, in part, on the discovery of approximately 900 genes which 
are differentially expressed In tissue from ovarian cancer as compared to normal tissue. This 
methods of this invention comprise measuring the activities of the approximately 900 or fewer 
genes that are shown to be differently-expressed in ovarian cancer as compared to normal 
tissue. 

[26] In a preferred embodiment, only a small fraction of the 900 genes would be 
measured. These measurements, could, in various embodiments, be in the tissue itself from 
biopsies, etc., or in preferred embodiments could be performed as more indirect 
measurement of gene expression including, but not limited to, cRNA or polypeptide 
expression products in various tissues including blood or other body fluids. 
[27] The measurements, direct or indirect, of the rates of expression of two or more of 
these 900 genes from an individual whose tissues status was unknown could then be 
compared to the expression values for the same two or more genes measured in ovarian 
cancer tissue or normal tissue. 

[28] The "degree of similarity" (DOS) of the unknown two or more gene expression 
values to the cancer tissue versus normal tissue would then be determined. 
[29] This DOS could be determined by any procedure that produces a result whose 
value is a known function of the DOS between the two groups of numbers, i.e., the measured 
gene expression values of the two or more genes in tissue from an individual whose ovarian 
cancer status is unknown and to be determined and the measured gene expression values 
for the same two or more genes from individuals whose tissue is known to contain ovarian 
cancer and from individuals whose tissue is known not to contain ovarian "cancer. " " 
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[30] As used herein the term "DOS" shall mean the extent to which the pattern of gene 
expression values are alike or numerically similar, as measured by a comparison of the 
values of gene expression determined by direct or indirect methods. 
[31] In a preferred embodiment, the DOS would be determined by a mathematical 
calculation resulting in a correlation coefficient (CC). In a particularly preferred embodiment, 
the Pearson Correlation Coefficient (PCC) would be determined but any other mathematical 
procedure that produces a result whose value is a known function of the DOS between the 
two groups of numbers could be used. 

[32] The value of the DOS (PCC), so calculated, can then be directly related to the 
probability that the tissue sample is from a patient who does or does not have ovarian 
cancer. That Is to say, the higher the patients' DOS (CC or PCC) as compared to the gene 
expression values from a patient who does not have ovarian cancer or the higher the DOS 
(CC or PCC) as compared to the gene expression values from a patient who does have 
ovarian cancer then the greater the probability that the patient does not or does have ovarian 
cancer, respectively. 

[33] Thus, in a given case, the value of the DOS can be used to determine probabilities 
for the presence of ovarian cancer. Those of skill in the art will understand that the clinical 
circumstance for each patient will dictate the value of the DOS (PCC) to be used as a cutoff 
or to help make clinical decisions with regard to a specific patient. For example, in one 
embodiment, it is desirable to determine with optimal accuracy the number of a group of 
patients who have ovarian cancer. This means to minimize both false positives (No Ovarian 
Cancer misclassified as Ovarian Cancer) and at the same time to minimize false negatives 
(Ovarian Cancer misclassified as No Ovarian Cancer ). 

[34] In one preferred embodiment of the present invention, this would work as shown in 
FIG. 3, using the 28 predictor probe set (as described below) if the gene expression profile 
correlates with the mean normal (No Ovarian Cancer) profile with a CC £0.920 the tissue 
sample is 63 times more likely to contain ovarian cancer then if the CC >0.920 [odds ratio 
(OR) = 63 with 95% confidence interval (CI): 3.3-1 1 94.7]. 

[35] To use this threshold in one embodiment of this invention, a patient whose gene 
expression profile when compared with the mean No Ovarian Cancer expression profile 
achieves a PCC of >0.920 would be classified in the No Ovarian Cancer group rand would be 
presumed not to have ovarian cancer, while a patient whose expression profile was had a 
PCC of £0.920 would be classified in the Ovarian Cancer group and would be assumed to 
have ovarian cancer with a high probability. 
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[36] In a further preferred embodiment, the PCC can be set to produce optional 
sensitivity. That is, to make the smallest possible number of false negatives (Ovarian Cancer 
misciassified as No Ovarian Cancer). Such an optimal sensitivity setting would be indicated 
in situations where the occurrence of ovarian cancer must be ruled out with the greatest 
certainty obtainable. In this embodiment, the threshold is determined by setting the PCC to 
>0.955. In this case, in the example given, using the 28 predictor probes shown in Table 9 
(probe sets 1-28 shown in Table 9), 100% of patients with a CC of >0.955 as compared to 
the No Ovarian Cancer group did not have ovarian cancer and 100% of the patients whose 
CC were <0.870, as compared to the No Ovarian Cancer group, did have ovarian cancer. 
[37] As is shown in the example, one of skill in the art can choose a PCC that will either 
maximize sensitivity or maximize specificity or produce any desired ratio of false positives or 
false negatives. One of skill in the art can easily adjust their choice of PCC to the clinical 
situation to provide maximum benefit and safety to the patient. 

[38] Another aspect of the of the invention are methods to treat ovarian cancer. These 
methods consist of various efforts to suppress the excess gene expression of the genes that 
have been found to be up-regulated in ovarian cancer. These genes are shown in Table 8. 
Methods to decrease the excess expression of these gene would include, but not be limited 
to, use of antisense DNA, siRNA and methods to complex and deactivate the protein 
expression products of these over-expressed genes. 
Methods of Measurement 

[39] In some embodiments of this invention, the gene expression of a selected group of 
the 900 genes is determined by measuring mRNA levels from tissue samples as described 
below. 

[40] In some embodiments, the gene expression can be measured more indirectly by 
measuring polypeptide gene expression products in tissues including, but not limited to, 
tumor and blood tissue. 

[41] In some embodiments, gene expression is measured by identifying the presence or 
amount of one or more proteins encoded by one of the genes listed in Table 9. 
[42] The present invention also provides systems for detecting two or more markers of 
interest, e.g., two or more markers from Table 2. For example, where it is determined that a 
finite set of particular markers provides relevant information, a detection system is provided 
that detects the finite set of markers. For example, as opposed to detecting all genes 
expressed in a tissue with a generic microarray, a defined microarray or other detection 
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technology is employed to detect the plurality, e.g.. 28, 42, etc., of markers that define a 
biological condition, e.g., the presence or absence of ovarian cancer, etc. 
[43] The present invention is not limited by the method in which gene expression 
biomarkers are detected or measured. In some embodiments, mRNA, cDNA or protein is 
detected in tissue samples, e.g., biopsy samples. In other embodiments, mRNA, cDNA or 
protein is detected in bodily fluids, e.g., serum, plasma, urine or saliva. A preferred 
embodiment of the invention provides that the method of the invention is performed ex vivo. 
The present invention further provides kits for the detection of these relevant gene 
expression biomarkers. 

[44] In some preferred embodiments, protein or the polypeptide expression product is 
detected. Protein expression may be detected by any suitable method. In some 
embodiments, proteins are detected by binding of an antibody specific for the protein. For 
example, in some embodiments, antibody binding is detected using a suitable technique 
including, but not limited to, radioimmunoassay, enzyme-linked immunosorbant assay 
(ELISA), "sandwich" immunoassays, immunoradiometric assays, gel diffusion precipitation 
reactions, immunodiffusion assays, in situ immunoassays, e.g., using colloidal gold, enzyme 
or radioisotope labels, e.g., Western blots, precipitation reactions, agglutination assays, e.g., 
gel agglutination assays, hemagglutination assays, etc., complement fixation assays, 
immunofluorescence assays, protein A assays, immunoelectrophoresis assays and 
proteomic assays, such as the use of gel electrophoresis coupled to mass spectroscopy to 
identify multiple proteins in a sample. 

[45] In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting binding of a 
secondary antibody or reagent to the primary antibody. In a further embodiment, the 
secondary antibody is labeled. Many methods are known in the art for detecting binding in 
an immunoassay and are within the scope of the present invention. 
[46] In some embodiments, an automated detection assay is utilized. Methods for the 
automation of immunoassays include, but are not limited to, those described in U.S. Patent 
Nos. 5,885,530; 4,981,785; 6,159,750; and 5,358,691 , each of which is herein incorporated 
by reference. In some embodiments, the analysis and presentation of results is also 
automated. For example, in some embodiments, software that generates a diagnosis and/or - 
prognosis based on the presence or absence of a series of proteins corresponding to 
markers is utilized. 
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[47] In other embodiments, the immunoassay described in U.S. Patent Nos. 5,599,677 
and 5.672,480, each of which is herein incorporated by reference, is utilized. In other 
embodiments, proteins are detected by immunohistochemistry. In still other embodiments, 
markers are detected at the level of cDNA or RNA. 

[48] As used herein, the term "gene expression biomarkers" shall mean any biologic 
marker which can indicate the rate or degree of gene expression of a specific gene including, 
but not limited to. mRNA, cDNA or the polypeptide expression product of the specific gene. 
[49] In some embodiments of the present invention, gene expression biomarkers are 
detected using a PCR-based assay. In yet other embodiments, reverse-transcriptase PCR 
(RT-PCR) is used to detect the expression of RNA. In RT-PCR, RNA is enzymatically 
converted to cDNA using a reverse-transcriptase enzyme. The cDNA is then used as a 
template for a PCR reaction. PCR products can be detected by any suitable method 
including, but not limited to. gel electrophoresis and staining with a DNA-specific stain or 
hybridization to a labeled probe. 

[50] In some embodiments, the quantitative RT-PCR with standardized mixtures of 
competitive templates method described in U.S. Patent Nos. 5,639.606; 5,643. 765; and 
5,876,978, each of which is herein incorporated by reference, is utilized. 
[51] In preferred embodiments of the present invention, gene expression biomarkers are 
detected using a hybridization assay. In a hybridization assay, the presence or absence of a 
marker is determined based on the ability of the nucleic acid from the sample to hybridize to 
a complementary nucleic acid molecule, e.g., an oligonucleotide probe. A variety of 
hybridization assays are available. 

[52] In some embodiments, hybridization of a probe to the sequence of interest is 
detected directly by visualizing a bound probe, e.g.. a Northern or Southern assay. See. 
e.g.. Ausabel et al., eds., Current Protocols in Molecular Biology, John Wiley & Sons. NY 
(1991). In these assays. DNA (Southern) or RNA (Northern) is isolated. The DNA or RNA is 
then cleaved with a series of restriction enzymes that cleave infrequently in the genome and 
not near any of the markers being assayed. The DNA or RNA is then separated, e.g.. on an 
agarose gel, and transferred to a membrane. A labeled probe or probes, e.g., by 
incorporating a radionucleotide, is allowed to contact the membrane under low-, medium- or 
high-stringency conditions. Unbound probe is removed and the presence of binding is 
detected by visualizing the labeled probe. 

[53] In some embodiments, the DNA chip assay is a GeneChip (Affymetrix, Santa Clara, 
CA). See, e.g., U.S. Patent Nos. 6,045,996; 5,925,525; and 5,858,659, each of which is 
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herein incorporated by reference. The GeneChip technology uses miniaturized, high-density 
arrays of oligonucleotide probes affixed to a "chip". Probe arrays are manufactured by 
Affymetrix's light-directed chemical synthesis process, which combines solid-phase chemical 
synthesis with photolithographic fabrication techniques employed in the semiconductor 
industry. Using a series of photolithographic masks to define chip exposure sites, followed 
by specific chemical synthesis steps, the process constructs high-density arrays of 
oligonucleotides, with each probe in a predefined position in the array. Multiple probe arrays 
are synthesized simultaneously on a large glass wafer. The wafers are then diced, and 
individual probe arrays are packaged in injection-molded plastic cartridges, which protect 
them from the environment and serve as chambers for hybridization. 
[54] The nucleic acid to be analyzed is isolated, amplified by PCR and labeled with a 
fluorescent reporter group. The labeled DNA is then incubated with the array using a fluidics 
station. The array is then inserted into the scanner, where patterns of hybridization are 
detected. The hybridization data are collected as light emitted from the fluorescent reporter 
groups already incorporated into the target, which is bound to the probe array. Probes that 
perfectly match the target generally produce stronger signals than those that have 
mismatches. Since the sequence and position of each probe on the array are known, by 
complementary, the identity of the target nucleic acid applied to the probe array can be 
determined. 

[55] In other embodiments, a DNA microchip containing electronically captured probes 
(Nanogen, San Diego, CA) is utilized. See, e.g., U.S. Patent Nos. 6,017,696; 6,068,818; and 
6,051,380, each of which are herein incorporated by reference. Through the use of 
microelectronics, Nanogen's technology enables the active movement and concentration of 
charged molecules to and from designated test sites on its semiconductor microchip. DNA 
capture probes unique to a given gene expression biomarkers are electronically placed at, or 
"addressed" to, specific sites on the microchip. Since nucleic acid molecules have a strong 
negative charge, they can be electronically moved to an area of positive charge. 
[56] In still further embodiments, an array technology based upon the segregation of 
fluids on a flat surface (chip) by differences in surface tension (ProtoGene, Palo Alto, CA) is 
utilized. See, e.g., U.S. Patent Nos. 6,001,311; 5.985,551; and 5,474,796, each of which is 
herein incorporated by reference. Protogene's technology is based on the fact that fluids can 
be segregated on a flat surface by differences in surface tension that have been imparted by 
chemical coatings. Once so segregated, oligonucleotide probes are synthesized directly on 
the chip by ink-jet printing of reagents. 
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[57] In yet other embodiments, a "bead array" is used for the detection of gene 
expression biomarkers (lllumina, San Diego, CA). See, e.g., PCT Publications WO 99/67641 
and WO 00/39587, each of which is herein incorporated by reference, lllumina uses a BEAD 
ARRAY technology that combines fiber optic bundles and beads that self-assemble into an 
array. Each fiber optic bundle contains thousands to millions of individual fibers depending 
on the diameter of the bundle. The beads are coated with an oligonucleotide specific for the 
detection of a given marker. Batches of beads are combined to form a pool specific to the 
array. To perform an assay, the BEAD ARRAY is contacted with a prepared sample. 
Hybridization is detected using any suitable method. 

[58] In some preferred embodiments of the present invention, hybridization is detected 
by enzymatic cleavage of specific structures, e.g., INVADER™ assay, Third Wave 
Technologies. See, e.g., U.S. Patent Nos. 5,846,717, 6,090, 543; 6,001,567; 5,985,557; and 
5,994,069, each of which is herein incorporated by reference. In some embodiments, 
hybridization of a bound probe is detected using a TaqMan assay (PE Biosystems, Foster 
City, CA). See, e.g., U.S. Patent Nos. 5,962,233 and 5,538,848, each of which is herein 
incorporated by reference. The assay is performed during a PCR reaction. The TaqMan 
assay exploits the 5*-3" exonuclease activity of DNA polymerases, such as AMPLITAQ DNA 
polymerase. A probe, specific for a given marker, is included in the PCR reaction. The 
probe consists of an oligonucleotide with a 5'-reporter dye, e.g., a fluorescent dye and a 
3'-quencher dye. During PCR, if the probe is bound to its target, the 5*-3' nucleolytic activity 
of the AMPLITAQ polymerase cleaves the probe between the reporter and the quencher dye. 
The separation of the reporter dye from the quencher dye results in an increase of 
fluorescence. The signal accumulates with each cycle of PCR and can be monitored with a 
fluorimeter. 

[59] Additional detection assays that are produced and utilized using the systems and 
methods of the present invention include, but are not limited to, enzyme mismatch cleavage 
methods, e.g., Variagenics (see U.S. Patent Nos. 6,1 10,684; 5,958,692; and 5,851 ,770, 
herein incorporated by reference in their entireties); branched hybridization methods, e.g., 
Chiron (see U.S. Patent Nos. 5.849,481; 5,710,264; 5,124,246; and 5,624,802, herein 
incorporated by reference in their entireties); rolling circle replication (see, e.g., U.S. Patent 
Nos. 6,210,884 and 6,183,960, herein incorporated by reference in their entireties); NASBA 
(see, e.g., U.S. Patent No. 5,409,818, herein incorporated by reference in its entirety); 
molecular beacon technology (see, e.g.. U.S. Patent No. 6.150.097. herein incorporated by 
reference in its entirety); E-sensor technology (see Motorola, U.S. Patent Nos. 6,248,229; 
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6,221,583; 6,013,170; and 6,063,573, herein incorporated by reference in their entireties); 
cycling probe technology (see, e.g., U.S. Patent Nos. 5,403,71 1 ; 5,01 1 ,769; and 5,660,988, 
herein incorporated by reference in their entireties); ligase chain reaction [see Barnay, 
Proc. Natl. Acad. Scl. USA, Vol. 88, pp. 189-93 (1991)]; and sandwich hybridization methods 
(see, e.g., U.S. Patent No. 5,288,609, herein incorporated by reference in its entirety). 
[60] In some embodiments, mass spectroscopy is used to detect gene expression 
biomarkers. For example, in some embodiments, a MASSARRAY™ system (Sequenom, 
San Diego, CA) is used to detect gene expression biomarkers. See, e.g., U.S. Patent 
Nos. 6,043,031; 5,777,324; and 5,605,798, each of which is herein incorporated by 
reference. 

[61] In some embodiments, the present invention provides kits for the identification, 
characterization and quantitation of gene expression biomarkers. In some embodiments, the 
kits contain antibodies specific for gene expression biomarkers, in addition to detection 
reagents and buffers. In other embodiments, the kits contain reagents specific for the 
detection of nucleic acid, e.g., oligonucleotide probes or primers. In preferred embodiments, 
the kits contain all of the components necessary to perform a detection assay, including all 
controls, directions for performing assays and any necessary software for analysis and 
presentation of results. In some embodiments, the kits contain instructions including a 
statement of intended use as required by the Environmental Protection Agency or U.S. Food 
and Drug Administration (FDA) for the labeling of in vitro diagnostic assays and/or of 
pharmaceutical or food products. 

[62] Comparison of the organism's gene expression pattern, with the result expressed in 
Table 9, would indicate whether the organism has a gene expression profile which may 
indicate that the organism does or does not contain ovarian cancer. 
[63] In another embodiment, the present invention is a method of screening a test 
compound for the ability to inhibit, retard, reverse or mimic the gene expression changes 
characteristic of ovarian cancer. In a typical example of this embodiment, one would first 
treat a test mammal known to have ovarian cancer with a test compound and then analyze a 
representative tissue of the mammal for the level of expression of the genes or sequences 
which change in expression in response to ovarian cancer. Preferably, the tissue is biopsy 
material from the tumor or, in a preferred embodiment, an easily obtainable tissue, such as 
blood or serum. 

[64] One then compares the analysis of the tissue with a control mammal known to have 
ovarian cancer but not given the test compound and thereby identifies test compounds that 
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are capable of modifying the expression of the gene expression biomarkers sequences in the 
mammalian samples such that the expression is altered toward the No Ovarian Cancer 
pattern. 

[65] In another embodiment of the present invention, one would use the sequences of 
the genes disclosed in Table 2 for a therapy for mimicking the No Ovarian Cancer state. In 
general, one would try to amplify gene expression for the genes identified herein as under- 
expressed in ovarian cancer and decrease the expression of genes identified herein as over- 
expressed in ovarian cancer. For example, one might try to decrease the expression of 
genes or sequences identified in Table 2 as increased or increase the expression of genes 
found to be decreased in ovarian cancer. 

[66] Methods of increasing and decreasing expression would be known to one of skill in 
the art. Examples for supplementation of expression would include supplying the organism 
with additional copies of the gene. A preferred example for decreasing expression would 
include RNA antisense technologies or pharmaceutical intervention. The genes disclosed in 
Table 2 would be appropriate drug development targets. One would use the information 
presented in the present application for drug development by using cun-ently existing, or by 
developing, pharmaceutical compounds that either mimic or inhibit the activity of the genes 
listed in Table 2, or the proteins encoded by these genes. Therefore, the gene expression 
biomarkers or genes disclosed herein represent targets for pharmaceutical development and 
gene therapy or RNA antisense therapy with the goal of suppressing the changes 
characteristic of ovarian cancer at the molecular level. These gene expression alterations 
may also play a role in understanding the various mechanisms that underlie ovarian cancer. 
Additionally, these genes represent biomarkers of ovarian cancer that can be used for 
diagnostic purposes. 

[67] The present invention is not limited by the form of the expression profile. In some 
embodiments, the expression profile is maintained in computer software. In some 
embodiments, the expression profile is written material. The present invention is not limited 
by the number of markers provided or displayed in an expression profile. For example, the 
expression profile may comprise two or more markers found in Table 2, indicating a 
biological status of a sample. 

[68] The present invention further provides databases comprising expression 
information, e.g., expression profiles comprising one or more markers from Table 2 from one 
or more samples. In some embodiments, the databases find use in data analysis including, 
but not limited to, comparison of markers to one or more public or private information 
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databases, e.g., OMIM, GenBank, BLAST, Molecular Modeling Databases, Medline, genome 
databases, etc. In some such embodiments, an automated process is carried out to 
automatically associate information obtained from data obtained using the methods of the 
present invention to information in one or more of public or private databases. Associations 
find use, e.g., in making expression correlations to phenotypes, e.g., disease states. 
[69] We also understand the present invention to be extended to mammalian 
homologues of the mouse genes listed in Table 9. One of skill in the art could easily 
investigate homologues in other mammalian species by identifying particular genes with 
sufficiently high homology to the genes listed in Table 9. By "high homology" we mean that 
the homology is at least 50% overall (within the entire gene or protein) either at the 
nucleotide or amino acid level. 



List of Abbreviations 
A Absent 

AvgDiff Average Difference (overall intensity 

of probe set on Affymetrix array) 
CHTN Cooperative Human Tissue Network 
CI Confidence Interval 

FIGO Federation of Gynecology and 
Obstetrics 

GAPDH Glyceraldehyde 3-phosphate 

dehydrogenase 
GNF Genomics Institute of the Novartis 

Research Foundation 
mg Milligram 



NCBI 


National Center for Biotechnology 




Information 


Neg 


Negative 


nM 


Nanometer 


OMIM 


Online Mendelian Inheritance in Man 


OR 


Odds Ratio 


ORF 


Open Reading Frame 


P 


Present 


PG 


Pharmacogenetics 


Pos 


Positive 


QC 


Quality Control 


RNA 


Ribonucleic Acid 



EXAMPLE 1 

Preferred Methods 

[70] To identify genes involved in the development and progression of ovarian tumors, 
we compared the gene expression profiles of a series of Normal and Tumor ovarian biopsies. 
Gene expression data for more than 12,000 genes were generated from each sample. Of 
the 900 probe sets that we observed to be most differentially-expressed between the Normal 
and cancerous ovarian biopsies, 98% were down-regulated in the Tumor biopsies. Using 
8 Normal and 10 Tumor samples, we identified a minimum number of probe sets (28) that 
could be used to classify biopsies as Normal or Tumor. This finding was validated on a 
second set of biopsies (4 Normal and 14 Tumor) previously profiled by another laboratory. A 
mean Normal ovarian profile was established that could be used as a reference to compare 
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other ovarian biopsies. The identification of the most differentially-expressed genes between 
Normal and Tumor ovarian biopsies may provide new insight into the molecular mechanisms 
of ovarian tumor development and progression. Some of the genes identified in this study 
are known to be involved in ovarian cancer, but a large proportion represents novel 
candidates for drug targets and molecular biomarkers to diagnose or monitor disease and 
treatment. 

Materials and Method* 
Samples 

[71] Flash-frozen ovarian biopsies were obtained from Asterand (Detroit, Ml) and 
consisted of 10 Tumor samples and 10 adjacent Normal tissues. Total RNA was also 
purchased for 4 additional samples from Ambion (Austin, TX) and Stratagene (La Jolla CA) 
Gene expression profiles from samples used in the validation step had been previously 
generated at GNF and reported. See Welsh et al. (2001 ), supra. 

[72] Most of the tumors analyzed were malignant surface epithelial serous tumors e g 
papillary cystcarcinoma, papillary cystadenocarcinoma or papillary cystcarcinoma; others ' 
included a mucinous cyst carcinoma, an endometrioid adenocarcinoma and a mature 
teratoma. 

[73] A summary of sample information is shown in Table 1 below. 
Tab,e 1 ' Ovarian Samples Used for Gene Expression Analysis 



Sample 
code 



P2437 



P2709 



P5720 



Status 



Normal 



Normal 



Normal 



Comment/Tumor Type 



Normal margin to cystadenoma 



No pathology data 



Tumor 
Cell (%) 



Source 



Stratagene 



Ambion 



P5721 



Tumor 



P6166 
P6167 



Tumor 



Adenocarcinoma 
Serous cyst carcinoma 



Not 
Available 



Ambion 



Normal 



70 



Asterand 



p6168 



Tumor 



Serous cyst carcinoma 



Asterand 



p6169 
p6170 



Normal 



50 



Asterand 



Tumor 



P6171 



Serous cyst carcinoma 



p6172 



Normal 
Tumor 



80 



IC 



Asterand 



p6173 
p6174 



Endometrioid adenocarcinoma 



Asterand 



Normal 



50 



IV 



Asterand 



Tumor 
Normal 



Mucinous cyst carcinoma 



40 



p6175 



Asterand 
Asterand 
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Sample 
code 



Status 



Comment/Tumor Type 



Tumor 
Cell {%) 



Stage 



Source 



P6176 



Tumor 



Papillary serous cyst carcinoma 



80 



III 



Asterand 



p6177 



Normal 



p6178 



Tumor 



Serous cyst carcinoma 



Asterand 



70 



Not 
Available 



Asterand 



p6179 



Normal 



Asterand 



p6180 



Tumor 



Serous cyst carcinoma 



70 



Not 
Available 



Asterand 



p6181 
p6182 



Normal 



Asterand 



Tumor 
Normal 



Serous cyst carcinoma 



70 



INC 



Asterand 



p6183 
P6184 



Asterand 



Tumor 



Mature teratoma 



P6185 



70 



Normal 



III 



Asterand 



Asterand 



OVR1T 



Tumor 



OVR2T 
OVR5T 



Tumor 



Adenocarcinoma, serous papillary 



40 



IIIC 



Adenocarcinoma, serous papillary 



CHTN 



60 



IIIC 



CHTN 



Tumor 



OVR8T 



Tumor 



Adenocarcinoma, serous papillary 



80 



NIB 



OVR10T 



Tumor 
Tumor 



Adenocarcinoma, serous papillary 



CHTN 



80 



Adenocarcinoma, serous papillary 



IVA 



CHTN 



40 



IVA 



CHTN 



OVR11T 



OVR12T 



Tumor 



Adenocarcinoma, serous papillary 



40 



IIIC 



OVR13T 



Tumor 



Adenocarcinoma, serous papillary 



CHTN 



50 



OVR16T 



Tumor 



Adenocarcinoma, serous papillary 



IIIC 



CHTN 



90 



OVR19T 



Tumor 



Adenocarcinoma, serous papillary 



40 



IIIC 
IIIC 



CHTN 



OVR22T 



Tumor 
Tumor 



Adenocarcinoma, serous papillary 



CHTN 



30 



IIIC 



Adenocarcinoma, serous papillary 



CHTN 



40 



IIIC 



CHTN 



OVR26T 



OVR27T 
OVR28T 



Tumor 
Tumor 



Adenocarcinoma, serous papillary 



60 



IIIC 



Adenocarcinoma, serous papillary 



CHTN 



40 



IIIC 



CHTN 



Adenocarcinoma, serous papillary 



80 



IIIC 



CHTN 



OVR102N 



Normal 



BioChain 
Institute 



OVR278EN 



Normal 



Enriched for epithelium 



BioChain 
Institute 



OVR278SN 



Normal 



Enriched for stroma 



BioChain 
Institute 



HUOVR 



Normal 



Note: Paired samples (Normal and Tumor adjacent tissue) obtained from 
together. Stages of ovarian cancers are indicated using the FIGO staging 



BioChain 
Institute 



the same patient are boxed 
system. 



RNA Exp ression Pmfillnq 

[74] Total RNA was extracted from each biopsy and processed as previously described. 
RNA extraction techniques are well-known to those of skill in the art. All samples profiled 
were processed using the Affymetrix GENECHIP™ system as recommended by Affymetrix 
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(GeneChip Expression Analysis Technical Manual, rev. 1, July 2001). Concentration and 

total amount of RNA and cRNA were estimated by measuring the samples at 260 nM and 

280 nM wavelengths using a Beckman-Coulter DU 650 spectrophotometer after a 1:50 

dilution of the samples (see Table 2). The type of array used for this study was the Human 

Genome U95Av2 (http://www.affvmetrix.cQm/p roducts/arravs/sDecific/hniig.qflffv) 
Analytical Strategy 

[75] Analysis of the expression profiles was performed in several steps described below. 
Selection ofmicroarray data of highest quality 

[76] We used for our analysis only microarrays for which the scaling factor was lower 
than 6, and where more than 30% of the probe sets were called "Present" by the Affymetrix 
MAS 4.0 algorithm. 
Selection of a subset of probe sets 

[77] Expression data were directly imported into the GENE SPRING® program (Silicon 
Genetics, Redwood City, CA) from the database. Genes expressed in only a few samples 
were eliminated; out of the 12,627 probe sets on the microarray, only those with an AvgDiff of 
at least 1 00 in 1 0% of the samples or more were used for further analysis. A clustering 
experiment was performed to visualize the different gene expression profiles of Normal and 
Tumor biopsies. 

[78] Further filtering was accomplished by eliminating probe sets of low quality or very 
low intensity signals in both groups of samples (Group 1 : Normal biopsies; Group 2: Tumor 
biopsies). Probe sets not called "Present" (P) in at least 75% of the samples in one of the 
two groups were not used for further analysis. In addition, AvgDiff values lower than 20 were 
all converted to a value of 20. 
Focus on the most differ entiallv-ey pr essed genes 

[79] Selection of genes differentially-expressed between the two groups of samples was 
done in 2 steps: 

1 . The AvgDiff of each probe set was compared between the 2 groups of samples by 
a non-parametric one-way ANOVA test, using SAS 8.2. 

2. The AvgDiff of each probe sets with p<0.05 was then correlated with the group of 
samples (Normal or Tumor). Probe sets were ranked from highest absolute PCC to 
lowest (calculated in Microsoft Excel). 
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Re-classification of samples 

[80] We used the "leave-one-out" analytical strategy previously described to determine 
the optimal number of probe sets that distinguished an ovarian tumor from a normal ovarian 
tissue. See van't Veer et al. (2002), supra. 

[81] For every sample left-out, we determined the average AvgDiff of each probe set in 
each group of samples (Groups 1 and 2). PCCs between the expression profile of the left- 
out sample and the average profile of each group were calculated for each probe set The 
effectiveness of each probe set in distinguishing a tumor from a normal ovarian tissue was 
evaluated by re-classifying each sample as Normal or Tumor based on the higher of the two 
CCs. 

[82] We determined the number of reclassified samples when using increasingly larger 
sets of genes (starting with 5). As used herein, a "false Neg" is defined as a Tumor 
incorrectly classified as a Normal ovary tissue, and inversely, a "false Pos" is defined as a 
Normal tissue incorrectly classified as a Tumor. 

[83] The probe sets that most-effectively distinguished tumor from normal ovarian tissue 
were then tested in their ability to classify gene expression profiles of a different set of 
ovarian tissues (Normal and Tumor) generated at GNF (see Table 1). 
Statistical d etermination nf OP 

[84] Using the desired threshold correlation value, a 2 x 2 table was constructed 
indicating the number of biopsies correctly and incorrectly identified as Normal or Tumor. 
ORs, along with 9 5 o/ 0 CIs, were calculated using SAS version 8.2. Statistical significance 
was determined using a Fisher's exact test with p-value cut-off of 0.05. 
Genes 

[85] The link between a probe set name and a GenBank Accession Number was 
provided by Affymetrix, together with a short gene description. We complemented and 
updated this description by a search of the NCBI databases, mainly LocusLink 
(http://www.ncbi.nim.nih gn v/LocusLink/inrt^v ht^n , qMIM 
(http://www.ncbi.nlm.nih n nv/ Q m i m / S6arfthnmim html> and PubMed 
(http://www.ncbi.nlm.nih goy/entre?/^ i^ry fr^i) 

Results 

RNA Expre ssion Profiling 

[86] Eighteen out of the 20 biopsies yielded more than the 5 mg of purified total RNA 
necessary to process the samples further (see Table 2). Sample P 6175. from which less 
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than 1 mg of purified total RNA was obtained, was the smallest sample (38 mg). The quality 
of the RNA was assessed by electrophoresis on a 1% agarose gel. The absence of both 
28S and 18S ribosomal RNA bands was observed for samples p6169 and p6180, Indicating 
some RNA degradation. For microarray hybridization, a maximum of 15 mg of cRNA was 
used when available, but no less than 12 mg. Enough cRNA was available for 21 samples 
(see Table 2). 
Quality Assessment 

[87] The data from the 21 arrays hybridized in PG (this study) and from the 18 hybridized 
previously at GNF were checked for quality (see Table 3). See Welsh et al. (2001 ), supra. 
All but 3 (p6169, P 6180 and p6185) passed our criteria of a scaling factor lower than 6, with 
more than 30 % of probe sets called "P" (see Table 3). The 36 remaining expression profiles 
were separated into 2 sets: a test set of 18 profiles generated in PG consisting of data from 
8 Normal art 10 Tumor biopsies, and a validation set of 18 profiles previously generated at 
GNF from 4 Normal and 14 Tumor biopsies. See Welsh et al. (2001), supra. 

Analysis 

Clustering Analysis 

[88] Expression data of the 1 8 samples of the test set were imported into the 
GENESPRING® software. Out of the 12,627 probe sets on the Affymetrix U95A microarray, 
2,174 had an AvgDiff of at least 100 in 2 or more of these 18 samples and were used for 
clustering analysis. The resulting clustering tree of samples and probe sets is shown in 
Fig. 1 . Interestingly, the dendogram of experiments contains two main branches 
corresponding to the two groups of samples, Normal (top) and Tumor (bottom) biopsies. The 
vast majority (>90%) of genes examined have an overall higher expression in the Normal 
ovarian tissues. The dendogram of probe sets shows only a small cluster of genes with 
higher expression in the Tumor tissues (left part). 
Selection of the most cii fferentiallv-fiXDressed g enest 

[89] Out of the 2,174 probe sets, 217 were excluded from further analysis because they 
provided large number of "A" or "marginal" calls (>75% in both groups). 
[90] Data for the remaining 1 ,957 probe sets were exported into SAS version 8.2 for non- 
parametric one-way ANOVA testing between the Normal and the Tumor groups. A total of 
900 probe sets had AvgDiff values significantly different between the two groups (p<0.05). 
These genes are listed in Table 9. 
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[91] The AvgDiff of these 900 probe sets was then correlated with the two groups of 
samples (Group 1: Normal; Group 2: Tumor). The absolute PCC (R)-values ranged from 
0.042-0.877, with 694 probe sets (77%) with a R-value higher than 0.5. The AvgDiff data of 
the 900 probe sets ranked from highest absolute PCC to lowest are available in Appendix 1 . 
Leave-one-out method an d re-classification gfsam£ les 

[92] The "leave-one-out" analytical strategy previously described was applied to the 
18 ovarian samples for the expression of the 900 selected probe sets. See van't Veer et al 
(2002), supra. 

[93] The number of misclassified samples when using the first 5 probe sets was 6 (2 
false Pos and 4 false Neg). Increasingly large sets of genes were used. The number of 
reclassifications varied between 2 and 7, with the minimum achieved when using the first 
28 probe sets (Fig. 1). These first 28 probe sets displayed only one false Pos and one false 
Neg. Interestingly, perfect classification of Normal biopsies (0 false Pos) was achieved with 
the first 32 probe sets (which also detected 2 false Neg), while perfect classification of Tumor 
biopsies (0 false Neg) was never seen. 
Optimal classification sat and correlation threshold uain*« 

[94] We determined the mean Normal (No Tumor) biopsy profile for the classification 
probe sets, to be used as a reference for analysis of biopsies of unknown or questionable 
status; we expected that tumor heterogeneity may not allow the determination of a reference 
Tumor profile. We examined the classification value of the first 28 probe sets, by comparing 
their expression for each of the 18 samples to the mean Normal profile calculated using all 
8 Normal biopsy profiles. Samples were then ranked by correlation values from highest to 
lowest and error rates were determined as a function of where the threshold correlation was 
drawn. The results are displayed in Fig. 2. The minimum number of incorrectly assigned 
samples was 2 [1 false Pos (p6177) and 1 false Neg (p6168)]. The corresponding CC value 
was between 0.920 and 0.921 . The OR and Fisher's exact test were performed based on 
the number of samples correctly and incorrectly predicted to be Normal or Tumor. The 
difference between the observed and expected biopsy status was significant: OR * 63- 95% 
CI: 3.3-1 1 94.7, p=0.0029. The OR indicates that an ovarian biopsy of the test set is nearly 
63 fames more likely to be from an ovarian tumor if its expression profile of the 28 predictor 
probe sets correlates with the mean Normal profile with a CC S0.920 (see Table 4) In our 
test set, 100% of profiles with a CC >0.955 correspond to Normal biopsies and 100% of 
profiles with a CC <0.870 correspond to Tumor biopsies (see Fig. 3). 
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Validation of the mn an Normal pmfiio 

[95] The 28 probe sets selected by the leave-one-out method allowed us to distinguish 
Norma, from Tumor ovarian biopsies in our series of 18 ovarian samples. We then tested if 
.ndependent ovarian biopsies could be correctiy classified by comparing their expression 
profile to the same mean Normal profile of the 28 classification probe sets. 
[96] Fig. 3 summarizes the classification of all ovarian biopsies based on the correlation 
of 28 probe sets. Remarkably, the profiles of the Normal and Tumor samples of the 
validation set were clearly separated from each other (see Fig. 3). As in the test set, 100% of 
profiles with a CC <0.870 correspond to Tumor biopsies, .nterestingly, Norma, profiles had 
lower CCs than in the test set, and the threshold correlation value that best separate Normal 
and Tumor biopsies in the validation set lies between 0.762 and 0.876. 
[97] We performed a non-parametric t-test between the average Normal profile of the 
test set and the average Normal profile of the validation set Similarly, we compared the 
average Tumor profiles of both sets. Since no statistical difference was observed (p=0 373 
and p=0.1 10, respectively), we combined both sets to increase the classification value of the 
28 probe sets. We compared the expression for all the samples to the mean Normal profile 
calculated using all 12 Normal biopsy profiles (8 from the test set and 4 from the validation 
test). Results confirm that correlation values provide highly-significant separation of the 
Normal biopsy from the Tumor biopsy profiles (see Fig. 4 and Table 5). As seen previously 
the profile of Tumor sample (p6168) had a high correlation with the average Normal profile ' 
Correlation betwee n individua l n ana exomxsinn ^ bionftv status 
[98] The selection of probe sets for the classification of ovarian biopsies was originally 
done based on the profile of the 18 test samples. The good separation of all 36 Norma, and 
Tumor samples (see Fig. 4) with the same probe sets suggested that the genes selected by 
our method are differentially-expressed in many other ovarian tumors. However, because of 
tumor heterogeneity, the difference in individual gene expression is likely to vary with the 
samples analyzed. We eva.uated to what extent the 900 probe sets differentially-expressed 
in the 18 test samples, were also differentially-expressed when all 36 biopsies were 
analyzed. 

[99] Probe sets were ranked from highest absolute PCC to lowest, first using the 
18 samples from the test set, and then with all 36 samples from both the test set and the 
valuation set. From the 900 probe sets selected, 694 and 473 had an absolute CC higher 
than 0.5 with the 18 and 36 samples, respectively; 412 probe sets had a coefficient higher 
than 0.5 ,n both cases. Interestingly, from the 28 probe sets originally selected for the biopsy 
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classification, 1 9 ranked in the top 100; the other 9 probe sets had correlation values ranging 
from 0.359-0.703. 

Genes differentially-expressed betwee n Normal and Tumor ovarian Nn n*ia* 
Genes up-r egulated in ovarian tumors 

[1 00] Among the genes differentially expressed between Normal and Tumor ovarian 
biopsies, we detected a few genes already known to be up-regulated in ovarian tumors, such 
as the genes coding for Claudin 4, topoisomerase II alpha, Kallikrein 8, osteopontin, as well 
as potential new markers of ovarian cancers (see Table 6). 

[101] Claudin 4, a component of tight Junctions, has been shown to be up-regulated in 
ovarian tumors together with another member of this family of transmembrane receptors, 
Claudin 3. See Hough et al.. Cancer Res., Vol. 60, No. 22, pp. 6281-6287 (2000). Costa 
and colleagues have reported that levels of topoisomerase II alpha correlate with poor 
prognosis of ovarian surface epithelial neoplasms. Kallikrein 8 has been detected by 
immunohistochemistry in carcinoma but not Normal ovarian tissue and was suggested as a 
prognostic marker of ovarian cancer. See Underwood et al., Cancer Res., Vol. 59, No. 17, 
pp. 4435-4439 (1999); and Magklara et al., Clin. Cancer Res., Vol. 7, No. 4, pp. 806-811 
(2001 ). Osteopontin has also been previously proposed as a diagnostic biomarker for 
ovarian cancer. See Kim et al., JAMA, Vol. 287, No. 13, pp. 1671-1679 (2002). Another 
gene, C20ORF1, has been shown to be expressed in lung carcinoma cell lines but not in 
normal lung tissues. See Manda et al., Genomics, Vol. 61, No. 1, pp. 5-14 (1999). Other 
genes that may have be over-expressed in only some of the biopsies due to the tumor type, 
the disease stage or other tumor specificity, were not detected by our analytical method. 
Genes do wn-regulated in ovarian tumnr* 

[1 02] We further examined a large number of genes down-regulated in the ovarian tumor 
biopsies profiled. For analysis purpose, we classified the 28 probe sets and the top 100 
down-regulated genes in 8 categories based on the known or suspected function of their 
product (see Tables 7 and 8). Interestingly, the function of nearly 30% of these 100 genes is 
still unknown. Most of the other genes play a role in, or are already suspected to be involved 
in transcription regulation (16 genes), in cell cycle regulation, growth differentiation, cell death 
or tumor suppression (12 genes) and signal transduction (6 genes). This list includes several 
potential tumor suppressors: the gene coding for the transforming growth factor 
beta receptor III (TGFPR3), a platelet-derived growth factor receptor-like gene (PDGFRL), 
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the suppression of tumorigenicity (ST13) gene, a gene coding for a reversion-inducing- 
cysteine-rich protein with kazal motif (RECK) and the paternally expressed 3 (PEG3) gene. 
[103] This observation suggests that the genes with still unidentified function are likely to 
be involved in cell cycle regulation, growth differentiation, signal transduction or transcription 
regulation. Some of them may act as tumor suppressors. Down-regulation in ovarian tumor 
or cell lines had been reported for just one of these genes, IGFBP5 which in our study was 
detected with 2 separate probe sets (see Table 8). See Welsh et al. (2001 ), supra. 
[104] Only 6 genes coding for proteins of the extracellular matrix were noticed including 
laminin alpha 2 (LAMo2). Yang and colleagues have reported that transient loss of LAMa2 
in the basement membrane of the pre-malignant epithelium and subsequent inactivation of 
Dab2 are common early event associated with tumorigenicity of the ovarian surface 
epithelium. See Yang etal., Cancer, Vol. 94, No. 9, pp. 2380-2392 (2002). Interestingly, 
down-regulation of Dab2 (probe set 479_at) was also observed in our study with a CC value 
of 0.49. 

[105] Taken together, these results indicated that most of the genes with a statistically 
significant decreased expression in the ovarian biopsies, are indeed involved in the 
development or progression of the tumors rather than detected because of a change in cell 
population or tissue organization, e.g., loss of connective tissue and fat cells. 

Discussion 

[1 06] The filtering and analytical methods that we used here, provided a list of genes 
differentially expressed between Normal and Tumor ovarian samples. We showed that a 
small subset (28-42 probe sets) is sufficient to accurately classify ovarian biopsies as Normal 
or Tumor based on their expression profiles. Validation of this expression signature was 
done on different biopsies profiled in an independent laboratory, and confirms that the 
difference in expression observed between the Normal or Tumor samples reflects a 
biological process rather than of a laboratory or analytical error. 
[1 07] Several factors not examined here that may affect the detection of differentially 
expressed genes include the number of samples in the test set, and the heterogeneity of the 
samples studied. Indeed, it is expected that biopsies and, in particular, Tumor biopsies, have 
a substantial level of heterogeneity: tumor type, grade, percentage of tumor cells, presence 
of connective and fat tissues, etc. We studied different types of ovarian tumors of various 
grades (see Table 1) to search for genes involved in common pathways of tumor 
development and progression, rather than genes involved more specifically in certain types 
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of tumors as previously reported. See Ono et al. (2000), supra; and Welsh et al. (2001), 
supra. 

[1 08] Our clustering analysis of the biopsy expression profiles, revealed that the vast 
majority of genes that differentiate Normal and Tumor samples were down-regulated in the 
tumors. Indeed, when the 900 most differentially-expressed probes were ranked based on 
the CC between their expression in all 36 biopsies and the Normal and Tumor status, the top 
220 probes (R from 0.865-0.644) were down-regulated in the tumors. We examined more 
closely the function of the top 100 genes (R from 0.865-0.72), and the top 10 genes over- 
expressed in the tumors (R from 0.643-0.443). Among the most differentially-expressed 
genes, we detected several genes already known to be up-regulated in ovarian tumors, as 
well as potential new markers of ovarian cancers. However, most of the genes were down- 
regulated, most likely because we studied various types of late stage tumors of different 
origins, different grades and different tumor cell content. The involvement of many of these 
genes in transcription regulation, in cell cycle regulation, growth differentiation, signal 
transduction, cell death or tumor suppression underscores the need to further evaluate their 
role in ovarian cancer. The list of other genes of still unknown function points to novel 
potential players in tumor development and progression. 
Methods of Modifying RNA Abundances orAnth/itte* 

[1 09] Methods of modifying RNA abundances and activities currently fall within three 
classes: ribozymes, antisense species and RNA aptamers. See Good et al., Gene Then, 
Vol. 4, No. 1 , pp. 45-54 (1 997). Controllable application or exposure of a cell to these 
entities permits controllable perturbation of RNA abundances. 
Ribozvmes 

[1 1 0] Ribozymes are RNAs which are capable of catalyzing RNA cleavage reactions. See 
Cech, Science, Vol. 236, pp. 1532-1539 (1987); PCT International Publication WO 90/1 1364 
(1990); Sarver et al., Science, Vol. 247, pp. 1222-1225 (1990). "Hairpin" and "hammerhead" 
RNA ribozymes can be designed to specifically cleave a particular target mRNA. Rules have 
been established for the design of short RNA molecules with ribozyme activity, which are 
capable of cleaving other RNA molecules in a highly sequence specific way and can be 
targeted to virtually all kinds of RNA. See Haseloff et al., Nature, Vol. 334, pp. 585-591 
(1988); Koizumi et al., FEBS Lett., Vol. 228, pp. 228-230 (1988); and Koizumi et al., 
FEBS Lett., Vol. 239. pp. 285-288 (1988). Ribozyme methods involve exposing a cell to, 
inducing expression in a cell. etc. of such small RNA ribozyme molecules. See Grassi and 
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Marini, Annals of Med., Vol. 28, No. 6, pp. 499-510 (1996); and Gibson, Cancer Meta Rev. 
Vol. 1 5. pp. 287-299 (1 996). 

[1 1 1] Ribozymes can be routinely expressed in vivo in sufficient number to be catalytically 

effective in cleaving mRNA, and thereby modifying mRNA abundances in a cell See 

Cotton et al., EMBO J., Vol. 8, pp. 3861-3866 (1989). In particular, a ribozyme coding DNA 

sequence, designed according to the previous rules and synthesized, e.g., by standard 

phosphoramidite chemistry, can be ligated into a restriction enzyme site in the anticodon 

stem and loop of a gene encoding a tRNA, which can then be transformed into and 

expressed in a cell of interest by methods routine in the art. Preferably, an inducible 

promoter, e.g., a glucocorticoid or a tetracycline esponse element, is also introduced into this 

construct so that ribozyme expression can be selectively controlled. For saturating use a 

h.ghly and constituently active promoter can be used. tDNA genes, i.e., genes encoding 

tRNAs, are useful in this application because of their small size, high rate of transcription and 

ubiquitous expression in different kinds of tissues. Therefore, ribozymes can be routinely 

des,gned to cleave virtually any mRNA sequence, and a cell can be routinely transformed 

with DNA coding for such ribozyme sequences such that a controllable and catalytically 

effective amount of the ribozyme is expressed. Accordingly, the abundance of virtually any 

RNA species in a cell can be modified or perturbed. 
Antisense molecules 

[1 1 2] In another embodiment, activity of a target RNA (preferably mRNA) species 
specifically its rate of translation, can be controllably inhibited by the controllable application 
of antisense nucleic acids. Application at high levels results in a saturating inhibition An 
"antrsense" nucleic acid as used herein refers to a nucleic acid capable of hybridizing to a 
sequence-specific, e.g., non-poly A, portion of the target RNA, e.g., its translation initiation 
reg.on, by virtue of some sequence complementary to a coding and/or non-coding region 
The antisense nucleic acids of the invention can be oligonucleotides that are double- 
stranded or single-stranded, RNA or DNA or a modification or derivative thereof, which can 
be d.rectly administered in a controllable manner to a cell or which can be produced 
intracellular* by transcription of exogenous, introduced sequences in controllable quantities 
sufficient to perturb translation of the target RNA. 

[113] Preferably, antisense nucleic acids are of at least six nucleotides and are preferably 
ol.gonucleotides, ranging from 6 oligonucleotides to about 200 oligonucleotides. In specific 
aspects, the oligonucleotide is at least 10 nucleotides, at least 15 nucleotides at least 
1 00 nucleotides or at least 200 nucleotides. The oligonucleotides can be DNA or RNA or 



WO 2005/005661 



PCT/EP2004/007167 



26 



chimeric mixtures or derivatives or modified versions thereof, single-stranded or double- 
stranded. The oligonucleotide can be modified at the base moiety, sugar moiety or 
phosphate backbone. The oligonucleotide may include other appending groups, such as 
peptides, or agents facilitating transport across the cell membrane [see, e.g., Letsinger et al.. 
Proc. Natl. Acad. Sci. USA, Vol. 86, pp. 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. ' 
Sci. USA, Vol. 84, pp. 648-652 (1987); and PCT Publication No. WO 88/09810 (1988)], 
hybridization-triggered cleavage agents [see, e.g., Krol et al., BloTechniques, Vol. 6, pp. 958- 
976 (1988)] or intercalating agents [see, e.g., Zon, Pharm. Res., Vol. 5, No. 9, pp 539-549 
(1988)]. 

[114] In a preferred aspect of the invention, an antisense oligonucleotide is provided, 
preferably as single-stranded DNA. The oligonucleotide may be modified at any position on 
its structure with constituents generally known in the art. 

[115] The antisense oligonucleotides may comprise at least one modified base moiety 
which is selected from the group including, but not limited to, 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine. 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, p-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
B-D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- ' 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v). 5-methyl-2-thiouracil. 3-(3-amino-3-/V- 
2-carboxypropyl) uracil, (acp3)w and 2,6-diaminopurine. 

[1 16] In another embodiment, the oligonucleotide comprises at least one modified sugar 
moiety selected from the group including, but not limited to, arabinose, 2-fluoroarabinose, 
xylulose and hexose. 

[1 17] In yet another embodiment, the oligonucleotide comprises at least one modified 
phosphate backbone selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate. a phosphord'iamidate, a 
methylphosphonate, an alkyl phosphotriester and a formacetal or analog thereof. 
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[118] In yet another embodiment, the oligonucleotide is a 2-a-anomeric oligonucleotide. 
An a-anomeric oligonucleotide forms specific double-stranded hybrids with complementary 
RNA in which, contrary to the usual B-units, the strands run parallel to each other. See 
Gautier et al., Nucl. Acids Res., Vol. 15, pp. 6625-6641 (1987). 
[1 1 9] The oligonucleotide may be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered cleavage 
agent, etc. 

[1 20] The antisense nucleic acids of the invention comprise a sequence complementary to 
at least a portion of a target RNA species. However, absolute complementary, although 
preferred, is not required. A sequence "complementary to at least a portion of an RNA", as 
referred to herein, means a sequence having sufficient complementary to be able to 
hybridize with the RNA, forming a stable duplex; in the case of double-stranded antisense 
nucleic acids, a single strand of the duplex DNA may thus be tested or triplex formation may 
be assayed. The ability to hybridize will depend on both the degree of complementary and 
the length of the antisense nucleic acid. Generally, the longer the hybridizing nucleic acid, 
the more base mismatches with a target RNA it may contain and still form a stable duplex (or 
triplex, as the case may be). One skilled in the art can ascertain a tolerable degree of 
mismatch by use of standard procedures to determine the melting point of the hybridized 
complex. The amount of antisense nucleic acid that will be effective in the inhibiting 
translation of the target RNA can be determined by standard assay techniques. 
[1 21] Oligonucleotides of the invention may be synthesized by standard methods known 
in the art, e.g., by use of an automated DNA synthesizer, such as are commercially-available 
from Biosearch, Applied Biosystems, etc. As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al., Nucl. Acids Res., Vol. 16, p. 3209 (1988), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore glass 
polymer supports, etc. See Sarin et al., Proc. Natl. Acad. Sci. USA, Vol. 85, pp. 7448-7451 
(1988). In another embodiment, the oligonucleotide is a 2 , -0-methylribonucleotide [see Inoue 
et al., Nucl. Acids Res., Vol. 15, pp. 6131-6148 (1 987)] or a chimeric RNA-DNA analog [see 
Inoue et al., FEBS Lett., Vol. 215, pp. 327-330 (1987)]. 

[1 22] The synthesized antisense oligonucleotides can then be administered to a cell in a 
controlled or saturating manner. For example, the antisense oligonucleotides can be placed 
in the growth environment of the cell at controlled levels where they may be taken up by the 
cell. The uptake of the antisense oligonucleotides can be assisted by use of methods well- 
known in the art. 
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Antisense Molecules Expressed Intracellularly 

[123] In an alternative embodiment, the antisense nucleic acids of the invention are 
controllably expressed intracellularly by transcription from an exogenous sequence. If the 
expression is controlled to be at a high level, a saturating perturbation or modification results. 
For example, a vector can be introduced in vivo such that it is taken up by a cell, within which 
cell the vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) 
of the invention. Such a vector would contain a sequence encoding the antisense nucleic 
acid. Such a vector can remain episomal or become chromosomally integrated, as long as it 
can be transcribed to produce the desired antisense RNA. Such vectors can be constructed 
by recombinant DNA technology methods standard in the art. Vectors can be plasmid, viral 
or others known in the art, used for replication and expression in mammalian cells. 
Expression of the sequences encoding the antisense RNAs can be by any promoter known 
in the art to act in a cell of interest. Such promoters can be inducible or constitutive. Most 
preferably, promoters are controllable or inducible by the administration of an exogenous 
moiety in order to achieve controlled expression of the antisense oligonucleotide. Such 
controllable promoters include the Tet promoter. Other usable promoters for mammalian 
cells include, but are not limited to, the SV40 early promoter region [see Bernoist and 
Chambon, Nature, Vol. 290, pp. 304-310 (1981)], the promoter contained in the 3' long 
terminal repeat of Rous sarcoma virus [see Yamamoto et al., Cell, Vol. 22, pp. 787-797 
(1980)], the herpes thymidine kinase promoter [see Wagner et al., Proc. Natl. Acad. Sci. 
USA, Vol. 78, pp. 1441-1445 (1981)], the regulatory sequences of the metallothionein gene, 
etc. [see Brinster et al. , Nature, Vol. 296, pp. 39-42 (1 982)]. 

[1 24] Therefore, antisense nucleic acids can be routinely designed to target virtually any 
mRNA sequence, and a cell can be routinely transformed with or exposed to nucleic acids 
coding for such antisense sequences such that an effective and controllable or saturating 
amount of the antisense nucleic acid is expressed. Accordingly the translation of virtually 
any RNA species in a cell can be modified or perturbed. 
RNA Aptamers 

[1 25] Finally, in a further embodiment, RNA aptamers can be introduced into or expressed 
in a cell. RNA aptamers are specific RNA ligands for proteins, such as for Tat and Rev RNA 
[see Good et al. (1997), supra] that can specifically inhibit their translation. 
Methods of Modifying Protein Abundances 

[1 26] Methods of modifying protein abundances include, inter alia, those altering protein 
degradation rates and those using antibodies, which bind to proteins affecting abundances of 
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activities of native target protein species. Increasing (or decreasing) the degradation rates of 
a protein species decreases (or increases) the abundance of that species. Methods for 
increasing the degradation rate of a target protein in response to elevated temperature 
and/or exposure to a particular drug, which are known in the art* can be employed in this 
invention. For example, one such method employs a heat-inducible or drug-induclble 
/V-terminal degron, which is an /V-terminal protein fragment that exposes a degradation signal 
promoting rapid protein degradation at a higher temperature, e.g., 37'C, and which is hidden 
to prevent rapid degradation at a lower temperature, e.g., 23°C. See Dohmen et al., 
Science, Vol. 263, pp. 1273-1276 (1994). Such an exemplary degron is Arg-DHFR* a 
variant of murine dihydrofolate reductase in which the /V-terminal Val is replaced by Arg and 
the Pro at position 66 is replaced with Leu. According to this method, e.g., a gene for a 
target protein, P, is replaced by standard gene targeting methods known in the art [see 
Lodish et al., Molecular Biology of the Cell, W.H. Freeman and Co., NY, especially Chapter 8 
(1995)] with a gene coding for the fusion protein Ub-Arg-DHFR te -P (W stands for ubiquitin). 
The /V-terminal ubiquitin is rapidly cleaved after translation exposing the /v-terminal degron. 
At lower temperatures, lysines internal to Arg-DHFR* are not exposed, ubiquitination of the 
fusion protein does not occur, degradation is slow and active target protein levels are high. 
At higher temperatures (in the absence of methotrexate), lysines internal to Arg-DHFR* are 
exposed, ubiquitination of the fusion protein occurs, degradation is rapid and active target 
protein levels are low. This technique also permits controllable modification of degradation 
rates since heat activation of degradation is controllably blocked by exposure methotrexate. 
This method is adaptable to other /V-terminal degrons which are responsive to other inducing 
factors, such as drugs and temperature changes. 
Modifying Protein Activi ty With AntihnHi** 

[127] Target protein activities can also be decreased by (neutralizing) antibodies. By 
providing for controlled or saturating exposure to such antibodies, protein 
abundances/activities can be modified or perturbed in a controlled or saturating manner. For 
example, antibodies to suitable epitopes on protein surfaces may decrease the abundance 
and thereby indirectly decrease the activity, of the wild-type active form of a target protein by 
aggregating active forms into complexes with less or minimal activity as compared to the 
wild-type unaggregated wild-type form. Alternately, antibodies may directly decrease protein 
activity by, e.g., interacting directly with active sites or by blocking access of substrates to 
active sites. Conversely, in certain cases, (activating) antibodies may also interact with 
proteins and their active sites to increase resulting activity. In either case, antibodies (of the 
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various types to be described) can be raised against specific protein species (by the methods 
to be described) and their effects screened. The effects of the antibodies can be assayed 
and suitable antibodies selected that raise or lower the target protein species concentration 
and/or activity. Such assays involve introducing antibodies into a cell (see below) and 
assaying the concentration of the wild-type amount or activities of the target protein by 
standard means, such as immunoassays, known in the art. The net activity of the wild-type 
form can be assayed by assay means appropriate to the known activity of the target protein 
[128] Antibodies can be introduced into cells in numerous fashions, including, e g 
microinjection of antibodies into a cell [see Morgan et al., Immunol. Today, Vol. 9, pp ' 84-86 
(1988)] or transforming hybridoma mRNA encoding a desired antibody into a cell [see Burke 
et al., Cell, Vol. 36, pp. 847-858 (1984)]. In a further technique, recombinant antibodies can 
be eng.neering and ectopically expressed in a wide variety of non-lymphoid cell types to bind 
to target proteins, as well as to block target protein activities. See Biocca et al., Trends Cell 
Biol., Vol. 5, pp. 248-252 (1995). Expression of the antibody is preferably under control of a 
controllable promoter, such as the Tet promoter, or a constitutively active promoter (for 
production of saturating perturbations). A first step is the selection of a particular monoclonal 
antibody with appropriate specificity to the target protein (see below). Then sequences 
encoding the variable regions of the selected antibody can be cloned into various engineered 
antibody formats, including, e.g., whole antibody, Fab fragments. Fv fragments, single-chain 
Fv (ScFv) fragments (V H and V L regions united by a peptide linker), diabodies (two 
associated ScFv fragments with different specificities) and so forth. See Hayden et al Curr. 
Op,n. Immunol., Vol. 9, pp. 210-212 (1997). Intracellularly-expressed antibodies of the' 
various formats can be targeted into cellular compartments, e.g., the cytoplasm, the nudeus 
the mrtochondria, etc., by expressing them as fusions with the various known intracellular 
leader sequences. See Bradbury et al., Antibody Engineering Borrebaeck. Editor. Vol 2 
pp. 295-361. IRL Press (1995). In particular, the ScFv format appears to be particularly ' 
suitable for cytoplasmic targeting. 

[129] Antibody types include, but are not limited to. polyclonal, monoclonal, chimeric 
s.ngle-chain, Fab fragments and an Fab expression library. Various procedures known'in the 
art may be used for the production of polyclonal antibodies to a target protein For 
production of the antibody, various host animals can be immunized by injection with the 
target protein, such host animals include, but are not limited to, rabbits, mice rats etc 
Various adjuvants can be used to increase the immunological response, depending on the 
host species and include, but are not limited to, Freund's (complete and incomplete); mineral 
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gels, such as aluminum hydroxide; surface active substances, such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, dinitrophenol; and potentially useful human 
adjuvants, such as Bacillus Calmette-Guerin (BCG) and corynebacterium parvum. 
[1 30] For preparation of monoclonal antibodies directed towards a target protein, any 
technique that provides for the production of antibody molecules by continuous cell lines in 
culture may be used. Such techniques include, but are not restricted to, the hybridoma 
technique originally developed by Kohler and Milstein, Nature, Vol. 256. pp. 496-497 (1975), 
the trioma technique, the human B-cell hybridoma technique [see Kozbor et al., Immunol. 
Today, Vol. 4. p. 72 (1983)] and the EBV hybridoma technique to produce human 
monoclonal antibodies [see Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. 
Liss, Inc., pp. 77-96 (1 985)]. In an additional embodiment of the invention, monoclonal 
antibodies can be produced in germ-free animals utilizing recent technology. See 
PCT/US90/02545. According to the invention, human antibodies may be used and can be 
obtained by using human hybridomas [see Cote et al., Proc. Natl. Acad. Sci. USA, Vol. 80, 
pp. 2026-2030 (1983)], or by transforming human B cells with EBV virus in vitro [see Cole et 
al. (1985), supra]. In fact, according to the invention, techniques developed for the 
production of "chimeric antibodies" [see Morrison et al., Proc. Natl. Acad. Sci. USA, Vol. 81, 
pp. 6851-6855 (1984); Neuberger et al., Nature, Vol. 312, pp. 604-608 (1984); Takeda et al. 
Nature, Vol. 314, pp. 452-454 (1985)] by splicing the genes from a mouse antibody molecule 
specific for the target protein together with genes from a human antibody molecule of 
appropriate biological activity can be used; such antibodies are within the scope of this 
invention. 

[131] Additionally, where monoclonal antibodies are advantageous, they can be 
alternatively selected from large antibody libraries using the techniques of phage display. 
See Marks et al., J. Biol. Chem., Vol. 267, pp. 16007-16010 (1992). Using this technique, 
libraries of up to 10 12 different antibodies have been expressed on the surface of fd 
filamentous phage, creating a "single pot" in vitro immune system of antibodies available for 
the selection of monoclonal antibodies. See Griffiths et al., EMBO J., Vol. 13, pp. 3245-3260 
(1994). Selection of antibodies from such libraries can be done by techniques known in the 
art, including contacting the phage to immobilized target protein, selecting and cloning phage 
bound to the target and subcloning the sequences encoding the antibody variable regions 
into an appropriate vector expressing a desired antibody format. 

[132] According to the invention, techniques described for the production of single-chain 
antibodies (see U.S. Patent No. 4,946.778) can be adapted to produce single-chain 
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antibodies specific to the target protein. An additional embodiment of the invention utilizes 
the techniques described for the construction of Fab expression libraries [see Huse et al 
Science, Vol. 246, pp. 1275-1281 (1989)] to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity for the target protein. 

[133] Antibody fragments that contain the idiotypes of the target protein can be generated 
by techniques known in the art. For example, such fragments include, but are not limited to 
the F(ab') 2 fragment which can be produced by pepsin digestion of the antibody molecule; 
the Fab' fragments that can be generated by reducing the disulfide bridges of the F(ab') 2 ' 
fragment, the Fab fragments that can be generated by treating the antibody molecule with 
papain and a reducing agent and Fv fragments. 

[1 34] In the production of antibodies, screening for the desired antibody can be 
accomplished by techniques known in the art, e.g., ELISA. To select antibodies specific to a 
target protein, one may assay generated hybridomas or a phage display antibody library for 
an antibody that binds to the target protein. 
Methods o f Modifying Protein Anth/Me* 

[135] Methods of directly modifying protein activities include, inter alia, dominant negative 
mutations, specific drugs or chemical moieties and also the use of antibodies, as previously 
discussed. J 

[1 36] Dominant negative mutations are mutations to endogenous genes or mutant 
exogenous genes that when expressed in a cell disrupt the activity of a targeted protein 
species. Depending on the structure and activity of the targeted protein, general rules exist 
that guide the selection of an appropriate strategy for constructing dominant negative 
mutations that disrupt activity of that target. See Hershkowitz, Nature, Vol. 329, pp. 21 9-222 
(1 987). In the case of active monomeric forms, over expression of an inactive form can 
cause competition for natural substrates or ligands sufficient to significantly reduce net 
activity of the target protein. Such over expression can be achieved by, e.g., associating a 
promoter, preferably a controllable or inducible promoter, or also a constitutively expressed 
promoter, of increased activity with the mutant gene. Alternatively, changes to active site 
residues can be made so that a virtually irreversible association occurs with the target Hgand 
Such can be achieved with certain tyrosine kinases by careful replacement of active site 
serine residues. See Perimutter et al., Cum Opin. Immunol., Vol. 8, pp. 285-290 (1996). 
[137] In the case of active multimeric forms, several strategies can guide selection of a 
dominant negative mutant. Multimeric activity can be decreased in a controlled or saturating 
manner by expression of genes coding exogenous protein fragments that bind to multimeric 



WO 2005/005661 



PCT/EP2004/007167 



33 



association domains and prevent multimer formation. Alternatively, controllable or saturating 
over-expression of an inactive protein unit of a particular type can tie up wild-type active units 
in inactive multimers, and thereby decrease multimeric activity. See Nocka et aL. EMBO J 
Vol. 9, pp. 1805-1813 (1990). For example, in the case of dimeric DNA binding proteins, the 
DNA binding domain can be deleted from the DNA binding unit, or the activation domain 
deleted from the activation unit. Also, in this case, the DNA binding domain unit can be 
expressed without the domain causing association with the activation unit. Thereby, DNA 
binding sites are tied up without any possible activation of expression. In the case where a 
particular type of unit normally undergoes a conformational change during activity, 
expression of a rigid unit can inactivate resultant complexes. For a further example, proteins 
involved in cellular mechanisms, such as cellular motility, the mitotic process, cellular 
architecture and so forth, are typically composed of associations of many subunits of a few 
types. These structures are often highly sensitive to disruption by inclusion of a few 
monomeric units with structural defects. Such mutant monomers disrupt the relevant protein 
activities and can be expressed in a cell in a controlled or saturating manner. 
[138] In addition to dominant negative mutations, mutant target proteins that are sensitive 
to temperature (or other exogenous factors) can be found by mutagenesis and screening 
procedures that are well-known in the art. 

[139] Also, one of skill in the art will appreciate that expression of antibodies binding and 
inhibiting a target protein can be employed as another dominant negative strategy. 
Modifying Proteins With Small iWofem/to r>m 7 o 

[140] Finally, activities of certain target proteins can be modified or perturbed in a 
controlled or a saturating manner by exposure to exogenous drugs or ligands. Since the 
methods of this invention are often applied to testing or confirming the usefulness of various 
drugs to treat cancer, drug exposure is an important method of modifying/perturbing cellular 
constituents, both mRNAs and expressed proteins. In a preferred embodiment, input cellular 
constituents are perturbed either by drug exposure or genetic manipulation, such as gene 
deletion or knockout; and system responses are measured by gene expression technologies 
such as hybridization to gene transcript arrays (described in the following). 
[141] in a preferable case, a drug is known that interacts with only one target protein in 
the cell and alters the activity of only that one target protein, either increasing or decreasing 
the activity. Graded exposure of a cell to varying amounts of that drug thereby causes 
graded perturbations of network models having that target protein as an input. Saturating 
exposure causes saturating modification/perturbation. For example, Cyclosporin A is a very 
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specific regulator of the calclneurln protein, acting via a complex with cyclophilin. A titration 
series of Cyclosporin A therefore can be used to generate any desired amount of inhibition of 
the calclneurln protein. Alternately, saturating exposure to Cyclosporin A will maximally 
inhibit the calcineurin protein. 
Measurement Methnris 

[142] The experimental methods of this invention depend on measurements of cellular 
constituents. The cellular constituents measured can be from any aspect of the biological 
state of a cell. They can be from the transcriptional state, in which RNA abundances are 
measured, the translation state, in which protein abundances are measured, the activity 
state, in which protein activities are measured. The cellular characteristics can also be from 
m.xed aspects, e.g., in which the activities of one or more proteins are measured along with 
the RNA abundances (gene expressions) of other cellular constituents. This section 
describes exemplary methods for measuring the cellular constituents in drug or pathway 
responses. This invention is adaptable to other methods of such measurement 
[143] Preferably, in this invention the transcriptional state of the other cellular constituents 
are measured. The transcriptional state can be measured by techniques of hybridization to 
arrays of nucleic acid or nucleic acid mimic probes, described in the next subsection or by 
other gene expression technologies, described in the subsequent subsection. However 
measured, the result is data including values representing mRNA abundance and/or ratios 
which usually reflect DMA expression ratios (in the absence of differences in RNA 
degradation rates). 

[144] In various alternative embodiments of the present invention, aspects of the 
biological state other than the transcriptional state, such as the translation^ state, the activity 
state or mixed aspects can be measured. 

[145] In all embodiments, measurements of the cellular constituents should be made in a 
manner that is relatively independent of when the measurement are made. 
Transcriptional State Me asurement 

[146] Preferably, measurement of the transcriptional state is made by hybridization to 
transcript arrays, which are described in this subsection. Certain other methods of 
transcriptional state measurement are described later in this subsection. 
Transcript Arrays Generally 

[147] In a preferred embodiment the present invention makes use of "transcript arrays" 
also called herein "microarrays". Transcript arrays can be employed for analyzing the 
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transcriptional state in a cell, and especially for measuring the transcriptional states of cancer 
cells. 

[148] In one embodiment, transcript arrays are produced by hybridizing detectably-labeled 
polynucleotides representing the mRNA transcripts present in a cell, e.g., fluorescently- 
labeled cDNA synthesized from total cell mRNA, to a microarray. A microarray is a surface 
with an ordered array of binding, e.g., hybridization, sites for products of many of the genes 
in the genome of a cell or organism, preferably most or almost all of the genes. Microarrays 
can be made in a number of ways, of which several are described below. However 
produced, microarrays share certain characteristics. The arrays are reproducible, allowing 
multiple copies of a given array to be produced and easily compared with each other. 
Preferably the microarrays are small, usually smaller than 5 cm 2 and they are made from 
materials that are stable under binding, e.g. nucleic acid hybridization, conditions. A given 
binding site or unique set of binding sites in the microarray will specifically bind the product of 
a smgle gene in the cell. Although there may be more than one physical binding site 
(hereinafter "site") per specific mRNA, for the sake of clarity the discussion below will assume 
that there is a single site. In a specific embodiment, positionally-addressable arrays 
containing affixed nucleic acids of known sequence at each location are used. 
[149] It will be appreciated that when cDNA complementary to the RNA of a cell is made 
and hybridized to a microarray under suitable hybridization conditions, the level of 
hybridization to the site in the array corresponding to any particular gene will reflect the 
prevalence in the cell of mRNA transcribed from that gene. For example, when detectably 
labeled, e.g., with a fluorophore, cDNA complementary to the total cellular mRNA is 
hybridized to a microarray, the site on the array corresponding to a gene, i.e., capable of 
specifically binding the product of the gene, that is not transcribed in the cell will have little or 
no signal, e.g., fluorescent signal, and a gene for which the encoded mRNA is prevalent will 
have a relatively strong signal. 
Preparat ion ofMicmarmy s 

[1 50] Microarrays are known in the art and consist of a surface to which probes that 
correspond in sequence to gene products, e.g., cDNAs, mRNAs, cRNAs, polypeptides and 
fragments thereof, can be specifically hybridized or bound at a known position In one 
embodiment, the microarray is an array, i.e., a matrix, in which each position represents a 
discrete binding site for a product encoded by a gene, e.g., a protein or RNA, and in which 
binding sites are present for products of most or almost all of the genes in the organism's 
genome. In a preferred embodiment, the "binding site", hereinafter "site", is a nucleic acid or 
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nucleic acid analogue to which a particular cognate cDNA can specifically hybridize. The 
nucleic acid or analogue of the binding site can be, e.g., a synthetic oligomer, a full-length 
cDNA, a less than full-length cDNA or a gene fragment. 

[1 51] Although in a preferred embodiment the microarray contains binding sites for 
products of all or almost all genes in the target organism's genome, such 
comprehensiveness is not necessarily required. Usually the microarray will have binding 
sites corresponding to at least about 50% of the genes in the genome, often at least about 
75%, more often at least about 85%, even more often more than about 90%, and most often 
at least about 99%. Preferably, the microarray has binding sites for genes relevant to testing 
and confirming a biological network model of interest. A "gene" is identified as an open 
reading frame (ORF) of preferably at least 50, 75 or 99 amino acids from which a mRNA is 
transcribed in the organism, e.g., if a single cell, or in some cell in a multicellular organism. 
The number of genes in a genome can be estimated from the number of mRNAs expressed 
by the organism, or by extrapolation from a well-characterized portion of the genome. When 
the genome of the organism of interest has been sequenced, the number of ORFs can be 
determined and mRNA coding regions identified by analysis of the DNA sequence. For 
example, the Saccharomyces cerevisiae genome has been completely sequenced and is 
reported to have approximately 6,275 ORFs longer than 99 amino acids. Analysis of these 
ORFs indicates that there are 5,885 ORFs that are likely to specify protein products. See 
Goffeau et al M Science, Vol. 274, pp. 546-567 (1996), which is incorporated by reference in 
its entirety for all purposes. In contrast, the human genome is estimated to contain 
approximately 10 s genes. 
Preparing Nucleic Acidst for Microarrays 

[152] As noted above, the "binding site" to which a particular cognate cDNA specifically 
hybridizes is usually a nucleic acid or nucleic acid analogue attached at that binding site. In 
one embodiment, the binding sites of the microarray are DNA polynucleotides corresponding 
to at least a portion of each gene in an organism's genome. These DNAs can be obtained 
by, e.g., PCR amplification of gene segments from genomic DNA, cDNA, e.g., by RT-PCR or 
cloned sequences. PCR primers are chosen, based on the known sequence of the genes'or 
cDNA, that result in amplification of unique fragments, i.e., fragments that do not share more 
than 1 0 bases of contiguous identical sequence with any other fragment on the microarray. 
Computer programs are useful in the design of primers with the required specificity and 
optimal amplification properties. See. e.g., Oligo pi version 5.0, National Biosciences. In the 
case of binding sites corresponding to very long genes, it will sometimes be desirable to 
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amplify segments near the 3' end of the gene so that when oligo-dT primed cDNA probes are 
hybridized to the microarray, less-than-full length probes will bind efficiently. Typically each 
gene fragment on the microarray will be between about 50 bp and about 2000 bp, more 
typically between about 100 bp and about 1000 bp, and usually between about 300 bp and 
about 800 bp in length. PCR methods are well-known and are described, e.g., in Innis et al., 
eds., PCR Protocols: A Guide to Methods and Applications, Academic Press Inc., San Diego, 
CA (1 990), which is incorporated by reference in its entirety for all purposes. It will be 
apparent that computer-controlled robotic systems are useful for isolating and amplifying 
nucleic acids. 

[1 53] An alternative means for generating the nucleic acid for the microarray is by 
synthesis of synthetic polynucleotides or oligonucleotides, e.g., using /V-phosphonate or 
phosphoramidite chemistries. See Froehler et al.. Nucleic Acid Res., Vol. 14, pp. 5399-5407 
(1986); and McBride et al., Tetrahedron Lett., Vol. 24, pp. 245-248 (1983). Synthetic 
sequences are between about 15 bases and about 500 bases in length, more typically 
between about 20 bases and about 50 bases. In some embodiments, synthetic nucleic acids 
include non-natural bases, e.g., inosine. As noted above, nucleic acid analogues may be 
used as binding sites for hybridization. An example of a suitable nucleic acid analogue is 
peptide nucleic acid. See, e.g., Egholm et al., Nature, Vol. 365, pp. 566-568 (1993); and also 
U.S. Patent No. 5,539,083. 

[1 54] In an alternative embodiment, the binding (hybridization) sites are made from 
plasmid or phage clones of genes, cDNAs, e.g., expressed sequence tags, or inserts 
therefrom. See Nguyen et al., Genomics, Vol. 29, pp. 207-209 (1995). In yet another 
embodiment, the polynucleotide of the binding sites is RNA. 
Attaching Nucleic Acids to the Solid Surface 

[155] The nucleic acid or analogue are attached to a solid support, which may be made 
from glass, plastic, e.g., polypropylene and nylon, polyacrylamide, nitrocellulose or other 
materials. A preferred method for attaching the nucleic acids to a surface is by printing on 
glass plates, as is described generally by Schena et al., Science, Vol. 270, pp. 467-470 
(1995). This method is especially useful for preparing microarrays of cDNA. See, also, 
DeRisi et al., Nat Genet, Vol. 14, pp. 457-460 (1996); Shalon et al., Genome Re's., Vol. 6, 
pp. 639-645 (1996); and Schena et al.. Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 10539-11286 
(1 995). Each of the aforementioned articles is incorporated by reference in its entirety for all 
purposes. 
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[1 56] A second preferred method for making microarrays is by making high-density 
oligonucleotide arrays. Techniques are known for producing arrays containing thousands of 
oligonucleotides complementary to defined sequences, at defined locations on a surface 
using photolithographic techniques for synthesis in situ [see Fodor et al.. Science, Vol. 251, 
pp. 767-773 (1991); Pease et al., Proc. Natl. Acad. Sci. USA, Vol. 91, No. 11, pp. 5022-5026 
(1994); Lockhart et al., Nat. Biotechnol., Vol. 14, p. 1675 (1996); and U.S. Patent 
Nos. 5,578,832; 5,556,752; and 5,510,270, each of which is incorporated by reference in its 
entirety for all purposes] or other methods for rapid synthesis and deposition of defined 
oligonucleotides [see Blanchard et al., Biosens. Bioelectron., Vol. 1 1, pp. 687-690 (1996)]. 
When these methods are used, oligonucleotides, e.g., 20 mers, of known sequence are 
synthesized directly on a surface such as a derivatized glass slide. Usually, the array 
produced is redundant, with several oligonucleotide molecules per RNA. Oligonucleotide 
probes can be chosen to detect alternatively spliced mRNAs. 

[157] Other methods for making microarrays, e.g., by masking, may also be used. See 
Maskos and Southern, Nucleic Acids Res., Vol. 20, pp. 1679-1684 (1992). In principal, any 
type of array, e.g.. dot blots on a nylon hybridization membrane [see Sambrook et al.. 
Molecular Cloning-A Laboratory Manual, 2 nd Edition, Vols. 1-3. Cold Spring Harbor 
Laboratory. Cold Spring Harbor. NY (1 989). which is incorporated in its entirety for all 
purposes], could be used, although, as will be recognized by those of skill in the art, very 
small arrays will be preferred because hybridization volumes will be smaller. 
Generating Labeled Pmhax 

[158] Methods for preparing total and poly(A) + RNA are well-known and are described 
generally in Sambrook et al. (1989), supra. In one embodiment. RNA is extracted from cells 
of the various types of interest in this invention using guanidinium thiocyanate lysis followed 
by CsCI centrifugation. See Chirgwin et al.. Biochemistry, Vol. 18. pp. 5294-5299 (1979). 
Poly(A) + RNA is selected by selection with oligo-dT cellulose. See Sambrook et al. (1989), 
supra. Cells of interest include wild-type cells, drug-exposed wild-type cells, cells with 
modified/perturbed cellular constituent(s). and drug-exposed cells with modified/perturbed 
cellular constituent(s). 

[159] Labeled cDNA is prepared from mRNA by oligo dT-primed or random-primed 
reverse transcription, both of which are well-known in the art. See. e.g.. Klug and Berger. 
Methods EnzymoL, Vol. 152. pp. 316-325 (1987). Reverse transcription may be carried out 
in the presence of a dNTP conjugated to a detectable label, most preferably a fluorescently- 
labeled dNTP. Alternatively, isolated mRNA can be converted to labeled antisense RNA 
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synthesized by in vitro transcription of double-stranded cDNA in the presence of labeled 
dNTPs. See Lockhart et ai. (1 996), supra, which is incorporated by reference in its entirety 
for all purposes. In alternative embodiments, the cDNA or RNA probe can be synthesized in 
the absence of detectable label and may be labeled subsequently, e.g., by incorporating 
biotinylated dNTPs or rNTP, or some similar means, e.g., photo-cross-linking a psoralen 
derivative of biotin to RNAs, followed by addition of labeled streptavidin, e.g., phycoerythrin- 
conjugated streptavidin or the equivalent. 

[160] When fluorescently-labeled probes are used, many suitable fluorophores are known, 
including fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, Cy3, 
Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others. See, e.g., Kricka, Nonisotopic DMA 
Probe Techniques, Academic Press, San Diego, CA (1992). It will be appreciated that pairs 
of fluorophores are chosen that have distinct emission spectra so that they can be easily 
distinguished. 

[161] In another embodiment, a label other than a fluorescent label is used. For example, 
a radioactive label, or a pair of radioactive labels with distinct emission spectra, can be used.' 
See Zhao et al., Gene, Vol. 156, p. 207 (1995); and Pietu et al., Genome Res., Vol. 6, p. 492 
(1996). However, because of scattering of radioactive particles, and the consequent 
requirement for widely-spaced binding sites, use of radioisotopes is a less-preferred 
embodiment. 

[162] In one embodiment, labeled cDNA is synthesized by incubating a mixture containing 
0.5 mM dGTP, dATP and dCTP plus 0.1 mM dTTP plus fluorescent deoxyribonucleotides, 
e.g„ 0.1 mM Rhodamine 1 10 UTP (Perken Elmer Cetus) or 0.1 mM Cy3 dUTP (Amersham), 
with reverse transcriptase, e.g., SuperScriptTM. II, LTI Inc., at 42°C for 60 minutes. 

Hybridizati on to Microarmy .t 

[1 63] Nucleic acid hybridization and wash conditions are chosen so that the probe 
"specifically binds" or "specifically hybridizes" to a specific array site, i.e., the probe 
hybridizes, duplexes or binds to a sequence array site with a complementary nucleic acid 
sequence but does not hybridize to a site with a non-complementary nucleic acid sequence. 
As used herein, one polynucleotide sequence is considered complementary to another when, 
if the shorter of the polynucleotides is S25 bases, there are no mismatches using standard 
base-pairing rules or, if the shorter of the polynucleotides is longer than 25 bases, there is no 
more than a 5% mismatch. Preferably, the polynucleotides are perfectly complementary (no 
mismatches). It can easily be demonstrated that specific hybridization conditions result in 
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specific hybridization by carrying out a hybridization assay including negative controls. See 
e.g., Shalon et al. (1996), supra; and Chee et al., supra. 

[164] Optimal hybridization conditions will depend on the length, e.g., oligomer vs 
polynucleotide >200 bases; and type. e.g.. RNA, DNA and PNA, of labeled probe and 
immobilized polynucleotide or oligonucleotide. General parameters for specific i e 
stringent, hybridization conditions for nucleic acids are described in Sambrook et al '(1996) 
supra; and Ausubel et al.. Current Protocols in Molecular Biology, Greene Publishing and ' 
Wiley-lnterscience. NY (1987), which is incorporated in its entirety for all purposes When 
the cDNA microarrays of Schena et al. are used, typical hybridization conditions are 
hybridization in 5 x SSC plus 0.2% SDS at 65°C for 4 hours followed by washes at 25'C in 
low-stringency wash buffer (1 x SSC plus 0.2% SDS) followed by 10 minutes at 25'C in high- 
stringency wash buffer (0.1 x SSC plus 0. 2 o/ 0 SDS). See Shena et al.. Proc. Natl. Acad Sci 
USA, Vol. 93. p. 10614 (1996). Useful hybridization conditions are also provided. See eg" 
Tijessen, Hybridization With Nucleic Acid Probes, Elsevier Science Publishers B V (1993)- *' 
and Kricka (1992). supra. ' ■ v ' wao; ' 

Signal Detection and n a ta Analy sis 

[165] When fluorescently-labeled probes are used, the fluorescence emissions at each 
s.te of a transcript array can be, preferably, detected by scanning confocal laser microscopy 
In one embodiment, a separate scan, using the appropriate excitation line, is carried out for 
each of the two fluorophores used. Alternatively, a laser can be used that allows 
simultaneous specimen illumination at wavelengths specific to the two fluorophores and 
emissions from the two fluorophores can be analyzed simultaneously. See Shalon et al 
(1996). supra, which is incorporated by reference in its entirety for all purposes In a 
preferred embodiment, the arrays are scanned with a laser fluorescent scanner with a 
computer-controlled X-Y stage and a microscope objective. Sequential excitation of the two 
fluorophores is achieved with a multi-line, mixed gas laser and the emitted light is split by 
wavelength and detected with two photomultiplier tubes. Fluorescence laser scanning 
devices are described in Schena et al. (1996). supra and in other references cited herein 
Alternatively, the fiber-optic bundle described by Ferguson et al.. Nat Biotechnol., Vol 14 
PP. 1681-1684 (1996). may be used to monitor mRNA abundance levels at a large number of 
sites simultaneously. 

[166] Signals are recorded and. in a preferred embodiment, analyzed by computer e g 
using a 12-bit analog to digital board. In one embodiment the scanned image is de-speckied 
us.ng a graphics program, e.g.. Hijaak Graphics Suite, and then analyzed using an image 
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gridding program that creates a spreadsheet of the average hybridization at each wavelength 
at each site. If necessary, an experimentally determined correction for "cross talk" (or 
overlap) between the channels for the two fluorophores may be made. For any particular 
hybridization site on the transcript array, a ratio of the emission of the two fluorophores is 
preferably calculated. The ratio is independent of the absolute expression level of the 
cognate gene, but is useful for genes whose expression is significantly modulated by drug 
administration, gene deletion or any other tested event. 

[167] Preferably, in addition to identifying a perturbation as positive or negative, it is 
advantageous to determine the magnitude of the perturbation. This can be carried out by 
methods that will be readily apparent to those of skill in the art. 
Other Methods of Trans criptional Stt^te Measurement 

[168] The transcriptional state of a cell may be measured by other gene expression 
technologies known in the art. Several such technologies produce pools of restriction 
fragments of limited complexity for electrophoretic analysis, such as methods combining 
double restriction enzyme digestion with phasing primers [see, e.g., EP 0 534858 A1 (1992) 
Zabeau et al.], or methods selecting restriction fragments with sites closest to a defined 
mRNA end [see. e.g., Prashar et al., Proc. Natl. Acad. Sci. USA, Vol. 93, pp. 659-663 
(1996)]. Other methods statistically sample cDNA pools, such as by sequencing sufficient 
bases, e.g., 20-50 bases, in each of multiple cDNAs to identify each cDNA, or by sequencing 
short tags, e.g., 9-10 bases, which are generated at known positions relative to a defined 
mRNA end pathway pattern. See, e.g., Velculescu, Science, Vol. 270, pp. 484-487 (1995) 
Measureme nt of Other Ax pant* 

[169] In various embodiments of the present invention, aspects of the biological state 
other than the transcriptional state, such as the translations state, the activity state or mixed 
aspects can be measured in order to obtain drug and pathway responses. Details of these 
embodiments are described in this section. 
Translatio nal State Measurements 

[170] Measurement of the translational state may be performed according to several 
methods. For example, whole genome monitoring of protein, i.e., the "proteome" [see 
Goffeau et al. (1996), supra], can be carried out by constructing a microarray in which 
binding sites comprise immobilized, preferably monoclonal, antibodies specific to a plurality 
of protein species encoded by the cell genome. Preferably, antibodies are present for a 
substantial fraction of the encoded proteins, or at least for those proteins relevant to testing 
or confirming a biological network model of interest. Methods for making monoclonal 
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an .bodies are well-known. See. e.g., Hariow and Lane, Antibodies: A Laboratory Manual 
Cold Spnng Harbor, NY (1988), whioh is inoorporated in its entirety for all purposes In a 

lelZZ TT enx ' monoc,ona, antibodles are raised agalnst ~ C ^ *~ 

des,gned based on genomic sequence of the cel.. With such an antibody array, proteins 
from the cell are contacted to the array and their binding is assayed with assays known in the 

1171) Alternatively, proteins can be separated by twodlmensional gel electrophoresis 
£ZL T r dim9nSi0nal 861 — is well-known In .he art and typically involves 
.sc-elecbrc focus,ng along a firs, dimension followed by SDS-PAGE electaphoresls along a 
second dimension. See, e.g., Hemes e. el.. Ge, Bectropnores* of Proteins: A Practical 
Approach, IRL Press, NY (1990); Shevchenko e. al.. Proc. Nat,. Acad SC. USA Vol 93 
PP. 1440-1445 (1996); Segllocco e. a,.. Yeas,. Vol. 12, pp. 1519-1533 (1996) . UndM ' 

numZ T T- 536 " 539 <1996) - reSUNin9 -" »e enalyzed by 

numerous techn,ques, .ncluding mass specbometric techniques, western bloMing and 

.mmunoblo. analysis using polyclonal and monoclonal antibodies, and internal and 

^terminal micro-sequencing. Using these techniques, I. is possible to identify a subs.an.lal 

fracon of a„ .he proteins produced under given physiological cond«ons, induding ,„ cel, 

SXP0Sed 10 3 ° f 061,8 TOdffled "* **• del6ti0n " * a 

Embodiments Based on 4 sna nfe nf „,„ p^^, CTaf a 

[172] Although monitoring cellular consents other man mRNA abundances currently 
presents certain lechnlca, oitafl. no. encountered in monitoring mRNAs, I, will be 
apparent te .hose of ski,, in me art mat me use of memods of alls invention ft* me acdvfttes 

It": *?" CharaCterizat,on - - - be measured, embodiments of 

th,s .nvenbon can be besed on such measurements. Acuity measurements can be 
perfonmed by any funcflohal. biochemical or physic, means appropriate to me particular 
ac*mybe,ng characterized. Where .be activity invohres a chemical transformed me 
cellular protein can be contacted with the natural substrates and the rate of transformation 

a"Zte d a ~ m ~ — E "an 

actvated DNA-bind,ng complex with DNA, the amount of associated protein or seondary 

consequences of the association, such as amounts of mRNA transcribed, can be measured 

Z^Z ^ *~* ' S kn ° Wn ' 6 ' 9 - 35 in 0611 <** ««■ P— 

«te funobon cen be observed. However known and measured, the changes in pretein 

•CM. form ma response date analyzed by the foregoing methods of this Invention 
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[173] In alternative and non-limiting embodiments, response data may be formed of mixed 
aspects of the biological state of a cell. Response data can be constructed from, e.g., 
changes in certain mRNA abundances, changes in certain protein abundances, and changes 
in certain protein activities. 
Computer Implementatinnx 

[1 74] In a preferred embodiment, the computation steps of the previous methods are 
implemented on a computer system or on one or more networked computer systems in order 
to provide a powerful and convenient facility for forming and testing models of biological 
systems. The computer system may be a single hardware platform comprising internal 
components and being linked to external components. The internal components of this 
computer system include processor element interconnected with a main memory. For 
example computer system can be an Intel Pentium based processor of 200 Mhz or greater 
clock rate and with 32 MB or more of main memory. 

[1 75] The external components include mass data storage. This mass storage can be 
one or more hard disks, which are typically packaged together with the processor and 
memory. Typically, such hard disks provide for at least 1 GB of storage. Other external 
components include user interface device, which can be a monitor and keyboards, together 
with pointing device, which can be a "mouse" or other graphic input devices. Typically the 
computer system is also linked to other local computer systems, remote computer systems, 
or wide area communication networks, such as the Internet. This network link allows the 
computer system to share data and processing tasks with other computer systems. 
[176] Loaded into memory during operation of this system are several software 
components, which are both standard in the art and special to the instant invention. These 
software components collectively cause the computer system to function according to the 
methods of this invention. These software components are typically stored on mass storage 
Alternatively, the software components may be stored on removable media such as floppy 
disks or CD-ROM (not illustrated). The software component represents the operating 
system, which is responsible for managing the computer system and its network 
interconnections. This operating system can be, e.g., of the Microsoft Windows family such 
as Windows 95, Windows 98 or Windows NT; or a Unix operating system, such as Sun 
Solaris. Software include common languages and functions conveniently present on this 
system to assist programs implementing the methods specific to this invention. Languages 
that can be used to program the analytic methods of this invention include C. C++ or, less 
preferably, JAVA. Most preferably, the methods of this invention are programmed in' 



WO 2005/005661 



PCT/EP2004/007167 



44 



mathematical software packages which allow symbolic entry of equations and high-level 
specification of processing, including algorithms to be used, thereby freeing a user of the 
need to procedurally program individual equations or algorithms. Such packages include 
e.g., Matlab from Mathworks (Natick, Mass.), Mathematica from Wolfram Research 
(Champaign, IL) and MathCAD from Mathsoft (Cambridge, MA). 
[177] In preferred embodiments, the analytic software component actually comprises 
separate software components which interact with each other. Analytic software represents 
a database containing all data necessary for the operation of the system. Such data will 
generally include, but is not necessarily limited to. results of prior experiments, genome data 
experimental procedures and cost and other information which will be apparent to those 
skilled In the art. Analytic software includes a data reduction and computation component 
comprising one or more programs which execute the analytic methods of the invention. 
[1 78] Analytic software also includes a user interface which provides a user of the 
computer system with control and input of test network models, and. optionally, experimental 
data. The user interface may comprise a drag-and-drop interface for specifying hypotheses 
to the system. The user interface may also comprise means for loading experimental data 
from the mass storage component, e.g., the hard drive; from removable media, e.g floppy 
disks or CD-ROM; or from a different computer system communicating with the instant 
system over a network, e.g., a local area network, or a wide area communication network 
such as the Internet. 

[179] Alternative systems and methods for implementing the analytic methods of this 
invention will be apparent to one of skill in the art and are intended to be comprehended 
wrthin the accompanying claims. In particular, the accompanying claims are intended to 
include the alternative program structures for implementing the methods of this invention that 
will be readily apparent to one of skill in the art. 
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Table 2. Quantification of Purified Total RNA and cRNA 
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Table 3. Summary of Experiment QC 
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Table 4. 



Comparison of Predicted Vs. Observed Status of 18 Ovarian Test Samples 
Witha 28 Probe Sets Expression Profile 



Biopsy status 
Obs. (Exp.) 



r £ 0.920 



r> 0.920 



Total 



1 (4.44) 



7 (3.56) 



9 (5.56) 



1 (4.44) 



10 



8 



18 



randomass^ Sh ° Wn for each W * —I--, with the va.ue expected under 

V = the PCC value of the 28 probe set profile of a biopsy sample with the mean Normal profile 
OR = 63 (95% CI: 3.3-1 1 94.7), p=0.0029 



Table 5. Comparison of Predicted Vs. Observed Status of 36 Ovarian Samples With a 
28 » 32 or 4 2 Probe Sets Expression Profile 



Number of Probe Sets 



Correlation threshold used 
OR 



28 



0.914 



32 



0.933 



42 



253 



392 



p-value (Fischer's exact test) 



14.4-4432.9 
2.3 x 10 -7 



14.8-10339.6 



1.0 x 10* 



216 
9.6-4882.5 



1.3 x 10" 7 
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Table 6. List of Genes Up-Regulated in Ovarian Tumors 

^rampleT^'r 8 ^ ^ eXPreSS '° n ,6Ve ' S *" 18 ^ SamP ' eS "* (R1) ° r h a " 



Probe Set 
Name 



Gene 
Symbol 



Description 



Cytogenetic 
Location 



R1 



R2 

0.636 



TOP2A 



Topoisomerase (DNA) II alpha 
(170kD) 



17q21-q22 



0.643 



C20ORF1 
ARL7 



Chromosome 20 ORF 1 



20q11.2 



ADP-ribosylation factor-like 7 



2q37.2 



37985_at 
2092 s at 



LMNB1 



Lamin B1 



5q23.3-q31.1 



0.608 
0.679 



SPP1 



Secreted phosphoprotein 1 
(osteopontin) 



38116 at 



KIAA0101 



KIAA0101 gene product 



4q21-q25 



0.560 



15q22.1 



0.556 



0.595 




Table 7. Functional Categories of the Most Differentially-Expressed Genes in 
Ovarian Cancer 



Cate gories 
Cell cycle regulation 
Growth differentiation and cell death 
Tumor suppression 



Top 28 Probe Sets 
8 



Top 10 0 Prob e Sets* 
13* 



Transcription regulation 



Metabolic enzymes 



Cytoskeletal proteins 



8 



27 



,GFBP5 ' ,RS1 ' ,TPR1 RBPMS) Simi,ar results were obtained with 2 
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Table 8 List of Genes Down-Regulated In Ovarian Tumors Functional Category 



Probe Set 
Name 



Gene 
Symbol 



Description 



Cytogenetic 
Location 



Cell cycle regulat ion, growth differentiation, cell death, t umor sudors Jn 



R1 



R2 



38120_at 
34257_at 
38650_at 
1396_at 
1897 at 



36073_at 



36160_s_at 
37643_at 



1640_at 



35234_at 



1731 at 



1761_at 



1327_s_at 

39701_at 
36948_at 



32668_at 



PKD2 
AIP1 
IGFBP5 
IGFBP5 
TGFpR3 



NON 



PTPRN2 
TNFRSF6 



ST13 



RECK 



Polycystic kidney 
disease 2 

Atrophin-1 interacting 
protein 1 

Insulin-like growth factor 
binding protein 5 

Insulin-like growth factor 
binding protein 5 

Transforming growth 
factor, beta receptor HI 



Necdin homolog 



4q21-q23 

7q21 
2q33-34 
2q33-q36 
1p33-p32 



Protein tyrosine 
phosphatase, receptor 
type, N polypeptide 2 

Tumor necrosis factor 
receptor superfamily, 
member 6 



Suppression of 
tumorigenicity 13 



PDGFRa 



PDGFRL 



MAP3K5 

PEG3 
CRM 



SSBP2 



Transcriptional regulation 



Reversion-inducing- 
cysteine-rich protein with 
kazal motifs 



Platelet-derived growth 
factor receptor, alpha 
polypeptide 



Platelet-derived growth 
factor receptor-like 



Mitogen-activated protein 
kinase kinase kinas e 5 

Paternally expressed 3 

CREBBP/EP300 inhibitory 
protein 1 



Single-stranded DNA 
binding protein 2 



15q11.2-q12 



7q36 
10q24.1 



22q13.2 



9p13-p12 



4q11-q13 



8p22-p21.3 



6q22.33 

19q13.4 
15q21.1-q21.2~ 



1577 at 



32664_at 
38439_at 



AR 



RNASE4 
NFE2L1 



Androgen receptor 



Ribonuclease, RNase A 
family, 4 

Nuclear factor (erythroid- 
derived 2)-like 1 



5q14.1 
Xq11.2-q12~ 



0.852 
0.846 
0.840 
0.821 
0.809 



0.804 



0.788 



0.779 



0.769 



0.741 



0.739 



0.736 



0.729 

0.724 
0.703 



0.768 
0.786 
0.748 
0.756 
0.770 



0.774 



0.846 



0.716 



0.734 



0.634 



0.619 



0.789 



14q11.1 
17q21.3 



0.503 



0.736 



0.626 

0.611 
0.766 



0.848 
0.829 



0.809 



0.767 



Rank 



0.749 



0.736 



25 
IF 



23 



21 



10 



27* 



6* 
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Probe Set 
Name 


Gene 
Symbol 


Description 


Cytogenetic 
Location 


R1 


R2 


rcanK 


38047_at 


RBPMS 


RNA-binding protein gene 
with multiple splicing 


8p12-p11 


0 822 


n 7«^ft 

yj. i oo 




40775_at 


ITM2A 


Integral membrane 
protein 2A 


Xa13.3-Xo21 2 


0 ftlft 

w-O 1 O 


U.OOO 




34163_g_at 


RBPMS 


RNA-binding protein gene 
with multiple splicing 


8p12-p11 


0.776 


0.709 




40570_at 


FOX01A 


Forkhead box OIA 
(rhabdomyosarcoma) 


13q14.1 


0.775 


0.668 




35681_r_at 


2FHX1B 


Zinc finger homeobox 1b 


2q22 


0.769 


0.644 




40202_at 


BTEB1 


oasic transcription 
element binding protein 1 


9q13 


0.754 


0.606 




41505_r_at 


MAF 


v-maf musculoaponeurotic 
fibrosarcoma oncogene 
homolog 


16a22-o23 


n 


n cod 
U.OOO 




34355_at 


MECP2 


Methyl CpG binding 
protein 2 (Rett syndrome) 


Xq28 


0.728 


0.662 




32259_at 


EZH1 


Enhancer of zeste 
homolog 1 


17q21.1-q21.3 


0.728 


0.611 




41000_at 


CHES1 


Checkpoint suppressor 1 


14q24.3-q31 


0.727 


0.677 




34740_at 


FOX03A 


Forkhead box 03A 


6q21 




U.520 




39243_s_at 


PSIP2 


PC4 and SFRS1 
interacting protein 2 


9p22.1 


0.721 


0.702 




Signal transdiu 


stion 










755_at 


ITPR1 

If 1 l \ I 


Inositol 1,4,5-triphosphate 
receptor, type 1 


3p26-p25 


0.809 


0.822 


4 


38176_at 




Guanine nucleotide 
binding protein (G protein), 
beta 5 


15q15.3 


0.805 


0.729 




39397_at 


NR2F2 


Nuclear receptor subfamily 
2, group F, member 2 


15q26 


0.768 


0.700 




872_i_at 


IRS1 


Insulin receptor 
substrate 1 


2q36 


0.756 


0.770 


24 


40994_at 


GPRK5 


G protein-coupled receptor 
kinase 5 


10q24-qter 


0.749 


0.627 




37908_at 


GNG11 


Guanine nucleotide 
binding protein 1 1 


7q31-q32 


0.748 


0.557 




41049_at 


IRS1 


Insulin receptor 
substrate 1 


2q36 


0.744 


0.694 




32778_at 


ITPR1 


Inositol 1 ,4,5-triphosphate 
receptor, type 1 


3p26-p25 


0.743 


0.689 " 




1135_at 


GPRK5 


G protein-coupled receptor 
kinase 5 


10q24-qter 


0.732 


0.652 




34877_at 


JAK1 


Janus kinase 1 


1p32.3-p31.3 


0.720 


0.671 
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Probe Set 

IIHl 1 1 %S 


Gene 

oymDOl 


Description 


Cytogenetic 
Location 


R1 


R2 


Rank 


41796_at 


PLCL2 


Phospholipase C-like 2 


3p24.3 


0.689 


0.779 


17* 


Metabolic enz; 


/mes 










32764_at 


PHYH 


Phytanoyl-CoA 
hydroxylase (Refsum 
disease) 


10pter-p11.2 


ft ono 


0.691 




37628_at 


MAOB 


Monoamine oxidase B 


| Xp11.4-p11.3 


0.865 


0.808 


7 


41859_at 


UST 


Uronyi-2-sulfotransferase 


6q24.3-q25.1 


0.865 


0.877 


1 


38220_at 


I DPYD 


Dihydropyrimidine 
dehydrogenase 


1p22 




ft QO/*> 


5 


3701 5_at 


ALDH1A1 


Aldehyde dehydrogenase 
1 family, member A1 


~ 9q21 


0.820 


0.751 




1290_g_at 


GSTM5 


Glutathione S-transferase 
M5 


1p13.3 


0.814 


0.776 


19 


o/ oyy ax 


AOX1 


Aldehyde oxidase 1 


2q33 


0.743 


0.652 




32805_at 


AKR1C1 


Aldo-keto reductase 
family 1, member C1 


10p15-p14 


0.736 


0.586 




34169_s_at 


OCRL 


Oculocerebrorenal 
syndrome of Lowe 


Xq25-q26.1 


0.722 


0.661 




32618_at 


BLVRA 


Biliverdin reductase A 


7p14-cen 


0.430 


0.786 


11* 


Cytoskeleton 












32145_at 


ADD1 


Adducin 1 (alpha) 


4p16.3 


0.817 


0.786 


12 


40488_at 


DMD 


Dystrophin (muscular 
dystrophy, Duchenne and 
Becker types) 


Xp21.2 


n 770 

M.I Jo 


0.698 




38669_at 


SLK 


Ste20-related serine/ 
threonine kinase 


10q25.1 


0.770 


0.712 




41738_at 


CALD1 


Caldesmon 1 


7q33 


0.755 


0.707 




34772_at 


COR02B 


Coronin, actin binding 
protein, 2B 


15q22.2-q22.31 


0.729 


0.748 




Extracellular m 


atrix 










39673_i_at 


ECM3 


Extracellular matrix 
protein 3 


9q22.3 


0.827 


0.698 




39674_r_at 


ECM2 


Extracellular matrix 
protein 2 


9q22.3 


0.811 


0.699 




oo91 7_at 


LAMa2 


Laminin, alpha 2 


6q22-q23 


0.810 


0.752 




41449_at 


SGCE 


Sarcoglycan, epsilon 


7q21-q22 


0.778 


0.762 


28 


36627_at 


SPARCL1 


SPARC-like 1 (mast9, 
hevin) 


4q21.3 


0.770 " 


0.675 " 
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Probe Set 
Name 

32535_at 

Others 
3571 7_at 



37394_at 



40767_at 
41137_at 

38122_at 



32526_at 
38119_at 
38634_at 
32109_at 
40496_at 
41138 at 



40786_at 



35354_at 
36873 at 



Gene 
Symbo l 

FBN1 



ABCA8 



C7 



TFPI 
PPP1R12B 

SLC23A1 



JAM3 
GYPC 
RBP1 
FXYD1 
C1S 
MIC2 



PPP2R5C 



RPL3 
VLDLR 



Description 

Fibrillin 1 (Marfan 
syndrome) 

ATP-blnding cassette, 
subfamily A (ABC1), 
member 8 

Complement component 7 

Tissue factor pathway 
inhibitor 

Protein phosphatase 1 , 
regulatory (inhibitor) 
subunit 12B 

Solute carrier family 23 
(nucleobase transporters), 
member 1 



Junctional adhesion 
molecule 3 



Glycophorin C (Gerbich 
blood group) 

Retinol binding protein 1, 
cellular 

FXYD domain containing 
ion transport regula tor 1 

Complement component 
1, s subcomponent 

Antigen identified by 
monoclonal antibodies 
12E7, F21 and 013 



Protein phosphatase 2, 
regulatory subunit B (B56), 
gamma isoform 

Ribosomal protein L3 

Very low density 
ipoprotein receptor 



Cytogenetic 
Location 


R1 


R2 


Rank | 


15q21.1 


0.729 


0.582 













5p13 



2q31-q32.1 
1q32.1 

20p13 



11q25 



2q14-q21 

3q23 

19q13.1 

12p13 

Xp22.32; 
Yp11.3 



3p21 



22q13 
9p24 



0.791 



0.783 
0.656 



0.779 
0.768 

0.761 



0.749 



0.747 
0.745 
0.744 
0.734 
0.731 



0.725 



0.723 
0.722 



0.640 
0.631 

0.707 



0.631 



0.669 
0.677 
0.670 
0.617 
0.565 



0.705 



0.667 
0.633 



3871 7_at 
36867_at 



DKFZP586 
A0522 

Unknown 



DKFZP586A0522 protein 
LOC92710 



16 




12q11 
1q31.1 



0.793 
0.785 



0.789 



0.739 



9 
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Probe Set 
Name 

39852 at 



40063 at 



41685 at 



37446_at 
36894 at 



41273_at 
40861 at 



35164 at 



39400 at 



38113 at 



34760_at 
33690 at 



41478_at 
32076 at 



40853 at 



39714 at 



36577 at 



38643 at 



38968_at 
37743 at 



32251 at 



Gene 
Symbol 

TAHCCP1 
NDP52 



KIAA0752 



KIAA0443 
Unknown 



Unknown 
MRGX 



WFS1 



KIAA1055 
SYNE-1 

KIAA0022 



Unknown 
KIAA1043 
DSCR1L1 



ATP10D 



SH3BGRL 



MIG2 



Unknown 
SH3BP5 

FEZ1 



Description 

Transactivated by hepatitis 
C viais core protein 1 

Nuclear domain 10 protein 



KIAA0752 protein 



MORF-related gene X 



Wolfram syndrome 1 
(wolframin) 



Synaptic nuclei expressed 
gene 1 

KIAA0022 gene product 



Down syndrome critical 
region gene 1-like 1 



ATPase.Class V, type 10D 



SH3 domain binding 
glutamic acid-rich protein 
like 



Mitogen inducible 2 



FLJ21174 



39743_at | FLJ20580 



36396_at Unknown 



35173_at | DXS1283E 



38394_at | K1AA0089 



4091 6_at | FLJ10097 



SH3-domain binding 
protein 5 (BTK-associated) 
Fasciculation and 
elongation protein zeta 1 
(zygin I) 



Hypothetical protein 
FLJ21174 



Cytogenetic 
Location 

13q13.1 
17q23.2 



5q35.3 



Xq22.1 
22q12.3-13.1 



Xq22 



4p16 



15q24.1 
6q25 

2q24.2 



22q12.1 
6p12.3 



4p12 



Xq13.3 



14q22.1 



3p24.3 
11q24.2 



Xq22.1 



1p33 



Xp22.3 



3p22.2 



Xq22.1-q22.3 



R1 

0.783 

0.777 



0.777 



0.771 
0.768 



0.766 
0.760 



0.758 



0.758 
0.754 



0.749 



0.747 
0.745 
0.741 



0.740 



0.738 



0.736 



0.735 
0.729 

0.729 



0.728 



0.702 



0.688 



0.617 



0.500 



0.359 



R2 

0.738 

0.729 



0.871 



0.613 
0.634 



0.758 
0.689 



0.621 



0.634 
0.681 

0.699 



0.565 
0.653 
0.713 



0.647 



0.743 



0.549 



0.714 
0.683 

0.617 



0.615 



0.775 



0.785 



0.774 



0.790 



0.777 



20* 



15* 



22* 



8* 



•Indicates genes from the 28 classification set not ranked within the 100 highest R1 values. 



18* 
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Table 9. 



_2_ 
3 



j4_ 
5 



Full Set of 900 Genes Differentially Affected in Ovarian Cancer 



Probe Sets 

41859_at 
41685_at 
36160 s at 



755_at 
38220_at 
32668 at 



CC 

0.877 
0.871 
0.846 



0.822 
0.82 
0.809 



Gene Symbol 
UST 
KIAA0752 
PTPRN2 



ITPR1 
DP YD 
SSBP2 



Cytogenetic 
Location 

6q24.3-q25 .1 

5q35.3 

7q36 



3p26-p25 
1p22 
5q14.1 



8 



9 
10 



37628 at 



38394 at 



3871 7_at 
1761 at 



0.808 
0.79 



MAOB 



KIAA0089 



0.789 
0.789 



DKFZP586A0522 



Xp11.4-p11.3 



3p22.2 



12q11 



PDGFRL 



8p22-p21.3 



12 
13 



32618 at 



32145 at 



0.786 



BLVRA 



0.786 



ADD1 



7p14-cen 



7p16.3 



14 



15 
16 



17 



18 



19 



20 



21 
22 
23 



34257_at 
40423 at 



36396_at 
35717 at 



41796 at 



40916 at 



1290_g_at 



39743 at 



36073_at 
35173_at 
1897 at 



40480 s at 



0.786 



AIP1 



0.785 
0.785 



KIAA0903 



0.783 



cDNA DKFZP586N 



ABCA8 



0.779 



PLCL2 



0.777 



Gene for hypothetical protein 
FLJ 10097 



0.776 
0.775 



GSTM5 



FLJ20580 



0.774 
0.774 
0.77 



NDN 
DXS1283E " 
TGFBR3 



7q21 



2p13.3 



4p16.3 



17q24 



3p24.3 



1p13.3 



1p33 



15q11.2-q12 
Xp22.3 
1p33-p32 



15q21.1-q21.2 
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84 



96 
97 



98 
99 



Probe Sets 



35742 at 




41147 at 



34760 at 



39674 r at 



39401 at 



CC 



0.708 



0.699 



0.699 



0.699 



0.699 



Gene Symbol 



LKAP 



MGC4276 



KIAA0022 



ECM2 



IMAGE clone 3460701 



Cytogenetic 
Location 



9q22.1 



2q24.2 



9q22.3 



100 
101 
102 



40488 at 



39673_l_at 



0.698 



DMD 



0.698 



ECM2 



Xp21.2 



9q22.3 



103 



104 
105 
106 



39864_at 
1127 at 



40674_s_at 
36975 at 



41049_at 
32593 at 



0.698 
0.697 



CIRBP 



RPS6KA1 



0.697 



HOXC6 



0.696 



MGC8721 



0.694 
0.692 



IRS1 



19p13.3 



12q13.3 



8p12 



107 



108 



109 
110 



1629 s at 



36620 at 



32764 at 



KIAA0084 



0.692 



PTPN13 



0.692 



SOD1 



0.691 



PGCP 



3p24.3 



4q21.3 



21q22.11 



8q22.2 



111 



112 



113 



114 
115 



116 



117 
118 



119 
120 



35785 at 



39681 at 



39438 at 



32778 at 



40861 at 



34990 at 



1736 at 



41771_g_at 



31852 at 



36542 at 



0.691 



GABARAPL1 



0.69 



ZNF145 



0.69 



CREBL2 



0.689 



ITPR1 



0.689 



MRGX 



0.688 



0.688 



SETBP1 



IGFBP6 



0.687 



0.687 



MAOA 



DKFZP564O043 



0.686 
0.684 



SLC9A6 



12p13.1 



11q23.1 



12p13 



3p26-p25 



Xq22 



18q21.1 



12q13 



Xp11.4-p11.3 



7p21 



Xq26.3 



121 



122 



123 



124 



125 
126 



37379_at 
38968 at 



39691 at 



38211 at 



38113 at 



40601_at 
2092 s at 



SF3A3 



0.683 



SH3BP5 



0.683 



SH3GLB1 



0.681 



ZNF288 



0.681 



SYNE-1 



0.68 
0.679 



BBP 
SPP1 



1p35.2 



3p24.3 



1p22 



3q13.2 



6q25 



1p32.1 
4q21-q25 
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132 
133 



134 
135 



136 



137 



138 
139 



142 



143 



144 



145 



147 



148 



149 



152 



153 



154 



155 



156 



157 



158 



159 



160 



163 



164 



165 



166 



Probe Sets 



33830_at 
34637_f_at 



38634_at 
35643_at 
41000 at 



37828_at 
35163 at 



36627_at 
853_at 



34356_ at 



3801 3_at 



38664_at 
32087_at 
38768_at 
34877_at 



1090_f_at 



32109_at 



34859 at 

— — 



38119_at 



146 31510_s_at 



1058_at 



40570 at 



39091_at 



32057_at 
31993_f_at 



35354at 



40775_at 



40140 at 



37406 at 



38685 at 



34363_at 



33351 at 



41655_at 



39072 at 



36544_at 
32542 at 



35767_at 



34355 at 



1578_g at 



41656_at 



167 34169 s at 

168 41662. at 

169 I _40203at 



CC 

0.679 
0.678 
0.677 
0.677 
0.677 



0.677 
0.676 
0.675 
0.674 
0.673 



0.672 



0.672 
0.672 
0.671 
0.671 



0.671 



0.67 



0.669 



0.669 



0.668 



0.668 



0.668 



0.668 



0.667 
0.667 



0.667 



0.666 
0.666 



0.665 



0.665 



0.665 



0.664 



0.664 



0.662 



0.662 
0.662 



0.662 



0.662 



0.661 



0.661 



0.661 
0.661 
0.66 



Gene Symbol 
HSOBRGR P 
ADH1A 
RBP1 
NUCB2 
CHES1 



FLJ11220 
KIAA1041 
SPARCL1 
NFE2L2 
SURB7 



ATIP1 



CFDP1 
HSF2 
HADHS C 
JAK1 



FXYD1 



MAGED2 



GYPC 



H3F3B 



WASF3 



FOXQ1A 



JWA 



P37NB 
EST 



SYNGR1 



ITM2A 
ZFP103 



MAPRE2 



STX12 



SEPP1 



GC20 



MID2 



MXI1 



clone 
FHL1 



GABARAPL2 



MECP2 



AR 



NMT2 



OCRL 
DKFZP566B183 
SUI1 



14q24.3-q31 



1p11.2 



2q31 
12p12.3 



8p22 



16q22.2-q22.3 



19q13.1 



Xp11.4-p11.1 



2q14-q21 



17q25 



13q12 



13q14.1 



3p14 



22q13.1 



18q12.1 



1p35-34.1 



5q31 



3p21.33 



Xq22 



1Qq24-q25 



16q22.3-q24.1 



Xq28 



Xq11.2-q12 



10p12.33-p12.32 
Xq25-q26.1 
12p13.32 
17 
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170 



171 



172 
173 



174 
175 
176 
177 
178 



179 
180 



181 



182 



183 



184 
185 
186 



187 



188 



189 
190 
191 



192 



193 



194 
195 



196 



197 
198 
199 



200 



201 



202 



203 



204 



205 
206 



207 



208 



209 



210 
211 
212 



Probe Sets 



3931 5_at 



226 at 



33878_at 
38408_at 



35704_at 
37394_at 
39829_at 
40770_f_at 
35846 at 



176 at 



36690_at 
39351 at 



41478 at 



950 at 



35359_at 
1135_at 
36091 at 



2003 s at 



35246 at 



40576_f_at 
37599_at 
35209 at 



38916 at 



33126 at 



37706 at 



40077 at 



37294 at 



32597_at 
32768_at 
36543 at 



37736 at 



40853 at 



41830 at 



37197_s at 



37205 at 



4061 7_at 
35681 r at 



41594 at 



192 at 



654 at 



41742_s_ at 
32676_at 
35752 s at 



CC 

0.659 



0.657 
0.657 



0.656 



0.656 
0.656 
0.656 
0.655 
0.655 



0.654 



0.654 
0.653 



0.653 



0.653 



0.653 
0.652 
0.652 



0.652 



0.652 



0.652 
0.652 
0.652 



0.651 



0.651 



0.65 



0.65 



0.649 



0.648 
0.648 
0.648 



0.647 



0.647 



0.647 



0.647 



0.645 



0.645 
0.644 



0.644 



0.643 



0.643 
0.642 



0.642 



0.642 



Gene Symbol 



ANGPT1 



PRKAR1A 



FLJ13612 
TM4SF2 



HRASLS3 
C7 
ARL7 
HNRPDL 
THRA 



PPP2R5C 



NR3C1 
CD59 
KIAA1043 



TLOC1 



PUM2 
GPRK5 
SCAP2 



MSH6 



TYRQ3 



HNRPDL 

AOX1 
KIAA0766 



CXorf6 



AD-017 



GLG1 



ACQ1 



BTG1 



RBL2 
FU21007 
F3 



PCMT1 



ATP10D 



KIAA0494 



DKFZP564A033 



FBXL7 



SAH 
ZFHX1B 



JAK1 



TAF7 



MXI1 



OPTN 
ALDH6A1 



PROS1 



Cytogenetic 
Location 



8q22.3-q23 



17q23-q24 



2q36.1 
Xq11.4 



11q13.1 
5p13 
2q37.2 
4q13-q21 
17q11.2 



3p21 



5q31 
11p13 



22q12.1 



3q26.2-q27 



2p22-p21 

10q24-qter 

7p21-p15 



2p16 



15q15.1-q21.1 



4q13-q2 1 
2q33 
3p22.1 



Xq28 



3p21.31 



16q22-q23 



9p22-p13 



12q22 



16q12.2 
13q21.1 
1p22-p21 



6q24-q25 



4p12 



1pter-p22.1 



5p15.1 



16p13.H 
2q22 



1p32.3-p31.3 



5q31 



1Qq24-q25 



10p12.33 
14q24.3 



3p11-q11.2 
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Probe Sets 
34774 at 



CC 



Gene Symbol 



Cytogenetic 
Location 



0.642 
0.642 



PPT1 



1p32 
6p21.1 



38892_at 



KIAA0240 



216 



224 



230 



231 



232 



233 
234 
235 



236 



659_g_at 



41288 at 



1678_g_at 
41505 r 



at 



2086_s at 



1147 at 



40145 at 



40211 at 



37617 at 



36489_at 
36488 at 



33443 at 



0.641 



0.641 



0.638 



0.638 



0.637 



0.637 



0.636 



0.636 



0.636 



0.636 



0.636 



0.635 



THBS2 



CALM1 



IGFBP5 



MAF 



TYR03 



NR2F1 



TOP2A 



HNRPA1 



KIAA1 128 



PRPS1 



EGFL5 



TDE1L 



6q27 




2q33-q36 




15q15.1-q21.1 



5q14 



17q21-q22 



12q13.1 



10q23.31 



Xq21-q27 



9q32-q33.3 



6q22.32 
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PCT/EP2004/007167 



306 



307 



308 



309 



310 



311 



312 



315 



316 



317 
318 



321 



324 



327 



328 



329 



330 



331 



332 



333 



334 



335 



336 



337 



338 



339 
340 
341 



Probe Sets 



1211 s at 



34819__at 
34808_at 
37446_at 
32792_at 
36650 at 



38438 at 



39701 at 



34215 at 



32259 at 



40839 at 



39055 at 



40508 at 



37985 at 



33799_at 
37638 at 



33140 at 



40202 at 



39033 at 



34789 at 



3381 7_at 
1719 at 



38923 at 



41338_at 
35751 at 



1377 at 



33123_at 
933 f at 



32696_at 



1323 at 



34349_at 



37731_at 



37315 f at 



36695_at 



31867_at 



31944_at 



1070_at 



38254__at 



3771 0_at 



33343 at 



32779_s at 



33865_at 
31508_at 



CC 



0.614 



0.614 
0.614 
0.613 
0.612 
0.612 



0.612 



0.611 



0.611 
0.611 



0.611 
0.61 



0.61 
0.608 



0.607 
0.607 



0.607 



0.606 



0.606 



0.606 



0.606 
0.606 



0.605 



0.605 
0.605 



0.605 



0.605 
0.604 



0.604 



0.604 



0.603 



0.602 



0.602 



0.602 



Gene Symbol 



0.602 



0.601 



0.601 



0.601 



0.6 



0.6 



0.599 



0.599 
0.599 



CRADD 



CD 164 
KIAA0999 
KI AA044 3 

P29 
CCND2 



NFKB1 



PEG3 



DXYS155E 



EZH1 



UBL3 



SRI 



GSTA4 



LMNB1 



SIAH2 
DOCK1 



B3GNT6 



BTEB1 



C1orf8 



SERPINB6 



HNRPA3 
MSH3 



FRG1 



AES 
SDHB 



NFKB1 



HRIHFB2206 
2NF91 



PBX3 



UBB 



SEC63L 



EPS15 



C14orf11 



cDNA FU40364 fis 



TULP3 



GTF2B 



KIAA0882 



MEF2C 



RNF14 



ITPR1 



BS69 
TXNIP 



Cytogenetic 
Location 



12q21.33-q23.1 



12p13 



4q24 



19q13.4 



Xp22.32 



17q21.1-q21.3 



13q12-q13 



7q21.1 



6p12.1 



5q23.3-q31.1 



3q25 



1Qq26.13-q26.3 
11q12.1 



9q13 



1p36-p31 



6p25 



10q11.1 
5q1 1-q12 



4q35 



19p13.3 
1p36.1-p35 



4q24 



16q22.1 
19p13.1-p12 



9q33-q34 



17p12-p11.2 



6q21 



1p32 



14q12 



3q13.12 



12p13.3 



1p22-p21 



4q31.1 



5q14 



5q23.3-q31.1 



3p26-p25 



10p14 
1q12 
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Probe Sets 


CC 


Gene Symbol 


u y iwgeneuc 
Location 


385 


41424_at 


E 0.588 


PON3 


7q21.3 


386 


32667 at 


0.588 


COL4A5 


Xq22 


387 


39979 at 


0.588 


l F10 


13q34 


388 


34753_at 


0.587 


SYBL1 


Xq28 


389 


39082_at 


0.587 


ANXA6 


5q32-q34 


390 


39989_at 


0.587 


RAGB 


Xp11.21 


391 


33235_at 


0.587 


NAV3 




392 


36825 at 


0.587 


TRIM22 


11p15 


393 


41747 s at 


0.587 


MEF2A 


F 1 5q26 


394 


38279 at 


0.587 


GNAZ 


22a 11 22 


395 


3251 1_at 


0.586 


cDNA FLJ37094 fis 




396 


32805 at 


0.586 


AKR1C 


10d15-d14 


397 


1529_at 


0.586 


13CDNA73 


13q12.3 


398 


34570_at 


0.586 


RPS27A 


2d16 


399 


31932 f at 


0.586 


BTF3 


5q13.1 


400 


35055 at 


^ 0.586 


BTF3 


5q13.3 


401 


32822 at 


0.585 


SLC25A4 


4q35 


402 


37131 at 


0.585 


KLK8 


19a13 3-a13 4 


403 


3531 8_at 


0.585 


KIAA0475 


1 1p36.13-a41 


404 


36526_at 


0.584 


EXTL2 


1p21 


405 


38837 at 


0.584 


DJ971N18.2 


20d12 


406 


36492_at 


0.583 


PSMD9 


12a24 31-a24 *V> 


407 


3651 5_at 


0.583 


GNE 


9p11.2 


408 


35737_at 


0.583 


HMGN4 


6p21 .3 


409 


32535 at 


0.582 


FBN1 


15q21.1 


410 


39838 at 


0.582 


CLASP1 


2a21 3 


411 


1307_at 


0.582 


XPA 


9a22 3 


412 


40971 at 


0.581 


KIAA0229 


6d21 2 


413 


1319_at 


0.58 


DDR2 


1a12-a23 


414 


33892 at 


0.579 


PKP2 


12n11 


415 


33800 at 


0.579 


ADCY9 


16d13 3 


416 


39790 at 


0.579 


ATP2A2 


12a23-^i24 1 


417 


37725 at 


0.578 


PPP1CC 


12a24 1-n94 O 


418 


38711 at 


0.578 


CLASP2 


3d22 2-d22 1 


419 


32662 at 


0.578 


KIAA0170 


6pter-p21.31 


Aon 


o^oo£ at 


0.578 


MYH11 


16p13.13-p13.12 


421 


41013 at 


0.578 


cDNA DKFZp586M2022 




422 


32743 at 


0.577 


KIAA0453 


1p36.31-p36.11 


423 


35325 at 


0.577 


RAB14 


9q32-q34.11 


424 


36626_at 


0.577 


HSD17B4 


5q21 


425 


39038 at 


0.576 


FBLN5 1 


14q32.1 


426 


32160 at 


0.576 


SIAH1 


16q12 


427 


1501 at 


0.576 


IGF1 


12q22-q23 



WO 2005/005661 



PCT/EP2004/007167 



64 





Probe Sets 


CC 


Gene Symbol 


vy iu g tsi i e uw 

Location 


428 


32244 at 


0.576 


KIAA0737 


14q11.1 1 


429 


450_g_at 


0.576 


CGR19 


I 14q22.1 


430 


39083 at 


0.575 


UBE2D3 


4q22.2 


431 


36596 r at 


0.575 


GATM 


15q14 


I 432 


35276_at 


! 0.575 


CLDN4 


! 7q11.23 


433 


'36578_at 


0.575 


BIRC2 


1 1q22 


434 


32153 s at 


i 0.575 


? 




435 


33847 s at 


0.574 


CDKN1B 


! 12p13.1-n12 


! 436 


40432_at 


0.573 


GNS 


12q14 ! 


437 


39346 at 


0.573 


KHDRBS1 


1d32 


438 


38581_at 


0.573 


GNAQ 


9q21 


439 


37604 at 


0.573 


HNMT 


2q21.3 


440 


41691_at 


0.572 


KIAA0794 


3q29 


441 


201_s at 


0.571 


B2M 


15a21-a22 2 


442 


41600_at 


0.571 


PA2G4 


12q13 


443 


39762_at 


0.571 


ZNF262 


1 D32-D34 


444 


32099_at 


0.57 


KIAA0138 


19p13.3 


445 


41701_at 


0.57 


C6 


5d13 


446 


39150_at 


0.569 


RNF11 


1 oter-D22 1 


447 


36474 at 


0.569 


KIAA0776 


6q16.3 


! 448 


39685_at 


0.568 


E46L 


22a13 31 


449 


37391 at 


0.567 


CTSL 


9a21-a22 


450 


35843_at 


0.567 


NEK9 


14a24 2 


451 


41136_s_at 


0.567 


APP 


21a21 ^ 


452 


35203_at 


0.566 


MORF 


10q22 2 


453 


34162 at 


0.566 


RBPMS 


8Dl2-n11 


454 


35811 at 


0.566 


RNF13 


3q25.1 


455 


39110 at 


0.566 


EIF4B i 


12a13 13 


456 


35331 at 


0.565 


CTNNAL1 


9q31 .2 


457 


39663 at 


0.565 


MAN2A1 


5a21-a22 


458 


41138_at 


0.565 


MIC2 


Xd22 32 


459 


31936_s_at 


0.565 


LKAP 


16D13 2 


460 


38470 i at 


0.565 


APPBP2 


17a21-n23 


461 


33690 at 


0.565 


cDNA DKFZp434A202 




462 


39846_at 


0.564 


CTSF 


11q13 


4oo 


39109 at 


0.564 


C20orf1 


20q11.2 


464 


34372 at 


0.564 


UREB1 


Xp11.2 


465 


32521 at 


0.563 


SFRP1 


8p12-p11.1 


466 


35936_g_at 


0.563 


CPT1B 


22q 13.33 


467 


40698 at 


0.563 


CLECSF2 


12p13-p12 


468 


324_f at 


0.562 


BTF3 


5q13.3 


1 469 I 


333_s_at 


0.562 


RBMS1 


2q24.2 


470 


41606 at 


0.562 


DRG1 


22q12.2 
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Probe Sets 


CC 


Gene Symbol 


Cvtoaenetic 
Location 


471 


40281_at 


0.562 


NEDD5 


2q37 


472 


32240_at 


0.562 


PSMD5 


9q33.3 


473 


33441 at 


0.561 


TCTA 


3p21 1 


474 


39170 at 


0.561 


cDNA DKF2p564J0323 




475 


35366_at 


0.56 


NID 


1q43 


! 476 


41271 at 


0.56 


SLC7A8 


14q11.2 


477 


1530_g_at 


0.56 


13CDNA73 


13q12.3 


478 


33278_at 


0.56 


SAH 


16p13.11 


479 


38754 at 


0.56 


P8 


16p11.2 


480 


37891 at 


0.559 


i cDNADKFZp586F1822 




481 


36727 at 


0.558 


! ? 




482 


32506 at 


0.558 


TBC1D1 


4p14 


483 


37908_at 


0.557 


GNG11 


7q31-q32 


484 


39117 at 


0.557 


PHF2 


9q22.31 


485 


34320 at 


0.557 


PTRF 


17q21.2 


486 


36791_g_at 


0.556 


TPM1 


15q22 1 


487 


539_at 


0.556 


RYK 


3q22 


488 


40825_at 


0.556 


MAPRE3 


2p23.3-p23.1 


489 


32169_at 


0.556 


FBX021 


12q24.21 


490 


38782_at 


0.555 


GTF2H1 


11p15.1-p14 


491 


1677 at 


0.555 


IGFBP5 


2q33-q36 


492 


33899 at 


0.555 


ALDH9A1 


1q23.1 


493 


40843 at 


0.555 


ICAP-1A 


2p25.2 1 


494 


32172_at 


0.554 


SHARP 


1p36.33-p36.11 


495 


35303 at 


0.554 


INSIG1 


7q36 


496 


34235 at 


0.554 


GPR116 


6p12.3 


497 


818 s at 


0.554 


ATRX 


Xq13.1-q21 1 


| 498 


33113_at 


0.554 


CITED2 


6q23.3 


499 


34287 at 


0.553 


C21orf80 


21q22.3 


500 


33418 at 


0.553 


cDNA DKFZp434A012 




501 


36790 at 


0.552 


TPM1 


15q22.1 


502 


4081 1_at 


0.552 


COASTER 


6p11.1 


503 


41739 s at 


0.552 


CALD1 


7q33 


504 


509_at 


0.551 


MADH4 


18q21.1 


505 


37598 at 


0.551 


RASSF2 


20pter-p12.1 




oooz y at 


0.55 


DSIPI 


Xq22.3 | 


507 


41462 at 


0.55 


SNX2 


5q23 


508 


36032_at 


0.55 


? 




509 


39045 at 


0.549 


FU21432 


12p13.31 


510 


36577 at 


0.549 


MIG2 


14q22.1 


511 


39557_at 


0.549 


cDNAFLJ31246fis ! 




512 


3381 9_at 


0.549 


LDHB 


12p12.2-p12.1 i 


513 


38610 s at 


0.549 


KRT10 


17q21-q23 
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Probe Sets 


CC 


Gene Symbol 


Location 


514 


890_at 


0.548 


| UBE2A 


Xq24-q25 


515 


32730 at 


0.548 


KIAA1750 


8q22.1 


516 


1252_at 


0.548 


DP1 


5q22-a23 


517 


32239_at 


0.548 


MATN2 


8q22 


518 


33405_at 


0.548 


CAP2 


6p22.2 


519 


37266 at 


0.548 


ZNF32 


10a22-a25 


520 


39686 _jg_at 


0.548 


E46L 


I 22q13 31 


i 521 


40155_at 


0.547 


ABLIM1 


i 10q25 


522 


35988_i_at 


0.547 


MYST1 


16p11.1 


523 


34314 at 


0.547 


f RRM1 




524 


35213 at 


0.546 


I WBP4 


13a13 3 


525 


37676_at 


! 0.546 


PDE8A 


1 15a25 1 


526 


39545 at 


0.546 


L CDKN1C 


11d15 *5 


527 


37708 r at 


0.545 


ADH5 


4a21-a25 


528 


41686 s at 


0.545 


KIAA0752 


5a35 3 


529 


202_at 


05 45 


HSF2 


6a22 33 


! 530 


33399 at 


0.544 


? 




531 


39380_at 


0.544 


GTAR 


4a13 3 


532 


35166_at 


0.544 


DSCR3 


~ 1 J,vJ 

21a22 2 


533 


39693_at 


0.544 


MGC5508 


11a13 1 


534 


149_at 


0.544 


, DDX39 


19d13 1^ 


i 535 


40522_at 


0.544 


GLUL 




536 


40831 at 


0.544 


DKFZP586B0923 


10a22 2 


537 


32253 at 


0.544 


RERE 


1d36 1-d^6 9 


538 


1836 at 


0.543 


CCNI 


4n1^ 


539 


36991 at 


0.543 


SFRS4 


1d35 2 


540 


171 at 


0.543 


VBP1 


Xq28 


541 


38508 s at 


0.542 


CREBL1 


op21 .3 


542 


33856 at 


0.542 


CXX1 


Xq26 ' 


543 


36118_at 


0.542 


NCOA1 


2p23 


544 


32038 s at 


0.542 


SRP46 


11q22 


545 


36964 at 


0.542 


MBTPS1 


16q24 


546 


37005 at 


0.541 


NBL1 


lpoo.3-p36.2 


547 


34259_at 


0.541 


KIAA0664 


17p13.3 


548 


1725_s_at 


0.541 j 


Oncogene E6-Ap, 
Papillomavirus 




549 


34344 at 


0.541 


IKBKAP 


9q34 


550 


33303 at 


0.54 


SSPN 


12p11.2 


551 


32215 i at 


0.54 


RHOBTB3 


5q21.2 


552 


34675 at 


0.54 


cDNAFLJ 13555 fis 




553 | 


718_at 


0.54 


PRSS1 1 


10q26.3 


554 


35168_f at 


0.54 


COL16A1 


1p35-p34 


555 


33875 at 


0.539 


ATP6V0E 


5q35.2 
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557 



558 
559 



560 



561 
562 



563 



565 



567 



568 



569 



572 



573 



574 



575 



576 



577 



578 



579 



586 



587 



588 
589 



590 



593 



594 



598 



Probe Sets 



32548 at 



38980 at 



40998_at 
31907 at 



41770 at 



31605_at 
34684 at 



41872 at 



34853 at 



40467 at 



39405_at 



36925at 



32564 at 



33431 at 



37248_at 
39931 at 



35753at 



41713 at 



32171_at 



1675_at 



35644 at 



32569 at 



34370_at 



3801 1_at 



41194_at 
39509 at 



32143_at 
40634_at 



34255_at 
1101 at 



35999__r_at 



40083 at 



663_at 
39884a at 



1467 at 



31866_at 
1512_at 



39897 at 



38385at 



32170_g_at 
1850_at 
39366_at 



2062_at 



CC 



0.539 



0.539 



0.539 
0.539 



0.539 



0.539 
0.538 



0.538 



0.538 



0.537 



0.537 



0.537 



0.537 



0.536 



0.536 
0.535 



0.535 



0.535 



0.535 



0.535 



0.535 



0.534 



0.533 



0.533 



0.533 
0.533 



0.532 
0.532 



0.531 
0.531 



0.531 



0.531 



0.531 
0.531 



0.531 



0.529 
0.529 



0.529 



0.529 



0.529 
0.529 
0.529 



0.528 



Gene Symbol 



TEBP 



MAP3K7IP2 



TNRC11 
RPL14 



MAOA 



L 0C1 7122 0 
RECQL 



DFNA5 



FLRT2 



SDHD 



KIAA0266 



HSPA2 



SEC61B 



FMOD 



CPZ 
DYRK3 



PRPF8 



2NF36 



EIF5 



RASA1 



HEPH 



PAFAH1B1 



ARCN1 



C19orf2 



SRP14 
PDCD4 



OSR2 
NAP1L1 



DGAT1 
APBB1 



KIAA0781 



KIAA0625 



EIF1A 
HSA9761 



EPS8 



PD2 
DYRK1A 
KIAA1966 



DSTN 



FBXQ21 

MLH1 
PPP1R3C 



IGFBP7 



Cytogenetic 
Location 



12 



6q25.1-q25.3 



Xq13 
3p22-p21.2 



Xp11.4-p11.3 



12p13 



12p12 



7p15 



14q24-q32 



11q23 



13q12.2-q13.3 



14q24.1 



9q22.32-q31.3 



1q32 



4p16.1 
1q32 



17p13.3 



7q21.3-q22.1 



14q32.33 



5q13.3 



Xq1 1-ql2 



17p13.3 



11q23.3 



19q12 



15q22 
10q24 



8q22.1 
12q14.1 



8qter 
11p15 



11q23.2 



9q34.3 



_X 

5q11-q14 



12q23-q24 



19q13.1 
21q22.13 



4q13.1 



20p11.23 



12q24.21 
3p21.3 



1Qq23-q24 
4q12 
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| Probe Sets 


CC 


Gene Symbol 


Location 


599 


32765_f_at 


0.528 


PGCP 


8q22.2 


600 


38035_at 


0.528 


MTMR6 


13q12 


601 


37352 at 


0.528 


SP100 


2q36.1 


602 


36169 at 


0.528 


NDUFA1 


Xq24 


603 


37707_i_at 


0.528 


ADH5 


4q21-q25 


604 


41743 i at 


0.528 


OPTN 


10d12 33 


605 


34890_at 


0.527 


ATP6V1A1 


3a13 31 


606 


38351_at 


0.527 


cDNA DKFZp586L0120 




607 


38990 at 


0.526 


ICK 


6p12 3-D11 2 


608 


37389_at 


0.526 


L_ SMAP 


11p15.1 


609 


34445 at 


0.526 


KIAA0471 


1a24-a25 


610 


40859_at 


0.526 


L FLJ11806 


I4a31 3 i 


611 


| 37029 at 


0.526 


ATP50 


21q22.11 


612 


41490 at 


0.525 


L PRPS2 


Xd22 3-d22 2 


613 


39687 at 


0.525 


E46L 


22a 13 31 


! 614 


35247_at 


0.525 


SNAPC5 




615 


3441 7_at 


0.525 


cDNA DKFZp586E1 120 




616 


40105_at 


0.524 


L MUT 


6d21 


617 


41379__at 


0.524 


SMC5 


9a21 12 


618 


38059_g_at 


0.524 


DPT 


1a12-a23 


619 


39517 at 


0.524 


HTGN29 


5q31.1 


620 


38743 f at 


0.524 


RAF1 


3d25 


621 


41746 at 


0.523 


NHP2L1 


22a13 2-a1? ^1 


622 


507_s_at 


0.523 


ELF2 


"■ *" H *_| 1 O.O 1 

4a2ft 


623 


36423 at 


0.523 


P8 


16d11 2 


624 


40988 at 


0.522 


YME1L1 


10n14 


625 


34680 s at 


0.521 


KIAA0107 




626 


40962 s at 


0.521 


SMARCA2 


9p22.3 


627 


36792 at 


0.521 


TPM1 


15a22 1 


628 


40191_s at 


0.52 


KIAA0582 


2n14 


629 


37367 at 


0.52 


ATP6V1E1 


22a11 1 


630 


35221_at 


0.52 


PURA 




631 


38649_at 


0.52 


KIAA0970 


' V*VJ 1 *T. | | 


632 


34740 at 


0.52 


F0X03A 


6a21 


633 


41300 s at 


0.519 


ITM2B 


13q14.3 




fu^oa_g_at 


0.519 


MGC35048 


16p13.13 j 


635 


36829 at 


0.518 


PER1 ] 


17p13.1-17p12 


636 


869 at 


0.518 


GTF2A2 


15q21.2 


637 


3261 1_at 


0.518 


PBP 


12q24.22 


638 


37672_at J 


0.518 


USP7 ~f 


16p13.3 


639 


36533_at 


0.517 


PTGIS 


20q13.11-q13.13 


640 


40438 at 


0.517 


PPP1R12A 


12q15-q21 


641 


39118 at 


0.517 


DNAJA1 


9p13-p12 
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Probe Sets 


CC 


Gene Symbol 


^■jtji. rgji - tl — 

cytogenetic 
Location 


642 


39555 at 


0.517 


ING1L 


4q35 1 


643 


3851 8_at 


0.517 


SCML2 


Xd22 


644 


37027 at 


| 0.516 


AHNAK 


1 1a12-a1^ 

■ ■ h * *» H * o 


645 


40260_g_at 


0.516 


RBM9 


22a 13 1 1 


646 


147_at 


0.516 


TSG101 




647 


3761 6_at 


0.516 


AUH 


9n99 **** 


648 


39809 at 


I 0.516 


HBP1 


7a31 1 


649 


32119_at 


0.515 


cDNADKF2p586B211 




650 


35730 at 


0.515 


ADH1B 




651 


36095__at 


I 0.513 


CLIPR-59 


1 %7W| 1 O. I O 


652 


38654_at 


0.513 


HNRPU 


1 nd? 


653 


41768 at 


0.513 


PRKAR1A 




654 


38627_at 


0.512 


HLF 


iy n oo 


655 


487_g_at 


0.512 


CASP9 


1d36 ^-n^fi 1 


656 


33244 at 


0.512 


CHN2 


• \J I o.o 


657 


38724_at 


0.512 


KIAA0515 




658 


39740_g at 


0.512 


NACA 




659 


33850 at 


0.512 


L MAP4 




660 


35304_at 


0.511 


t RAB6A 


1 1 V| 1 o.o 


661 


! 36330_at 


0.511 


[ CCBL1 




662 


33240_at 


0.511 


SEMACAP3 


op | o 


! 663 


32745_at 


0.511 


MRPL40 




664 


274 at 


0.51 


ZNF148 


3q21 


665 




0.51 


AHCYL1 


1p12 


666 


34781 at 


0.509 


DCTN6 


ftnl 9-nl *f 


667 


38626 at 


0.509 


KIAA0399 


I'fJI o*o 


668 


38812 at 


0.509 


LAMB2 




669 


35736 at 


0.508 


GRINL1A 




670 


37359 at 


0.507 


KIAA0102 


I IC| IO.O 


671 


1873_at 


0.507 


XPC 


opzo 


672 
673 


35258_f_at 
40854_at 


0.507 
0.506 


SFRS2IP 




674 


40064 at 


0.506 


UQCRC2 
ALS2CR3 


16p12 
2q33 


67S 
\j i \j 




0.506 


MYH11 


16p13.13-p13.12 


676 


33444 at 


0.505 


M17S2 


17q21.1 


677 


38826_at 


0.505 


6-Sep 


Xq24 


! 678 


40832_s_at 


0.505 


LAP1B 


1q24.2 


679 


1195_s at 


0.505 


ICAP-1A 


2p25.2 


680 


35142 at 


0.504 


DKFZP564D172 


5q14.3 


681 
682 


38479 at 
37025_at 


0.504 
0.504 


ANP32B 
PIG7 


9q22.32 
16p13.3-p12 
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Probe Sets 


CC 


Gene Symbol 


Location 


683 


969_s_at 


0.503 


USP9X 


! Xp11.4 


684 


39739_at 


0.502 


NACA 


12q23-q24 1 


685 


! 41195_at 


0.502 


LPP 


3q27-q28 


686 


32576_at 


0.502 


t EIF3S5 


11 


687 


41242 at 


0.502 


UAP1 


1q23.1 


688 


35301 at 


0.502 


cDNA DKFZp564E2222 




689 


33380_at 


0.501 


POLS 


5p15 


690 


37221_at 


0.501 


PRKAR2B 


7q22-a31 1 1 


691 


3391 6_at 


0.501 


NISCH 


3d21 1 


692 


37895_at 


I 0.5 


SLC35A1 


6a16 1 


693 


1120 at 


0.5 


GSTM3 


1d13 3 


694 


31463 s at 


0.5 


? 




695 


36899 at 


0.499 


SATB1 


3p23 


696 


897 at 


0.497 


PKD1 


16d13 3 


697 


41174_at 


0.497 


RANBP2L1 


2a12 3 


698 


38106_at 


0.497 


YR-29 


5a13 3-a14 1 


699 


38673_s_at 


0.497 


CDKN1C 


11d15 5 


700 


41400_at 


0.497 


L TK1 


17a23 2-n?^ 3 


701 


41283 at 


0.496 


HNRPH3 


10a22 


702 


33835_at 


0.496 


KIAA0721 


6a22 31 


703 


34359 at 


0.496 


CGI-130 


6a 1 3-a24 3 


704 


38875 r at 


0.495 


GREB1 


2n25 1 I 


705 


40096_at 


0.495 


ATP5A1 


18a12-n?1 


706 


37529_at 


0.494 


CACNA1H 


16n13 3 


707 


3941 8_at 


0.493 


DKFZP564M182 


16n13 3 


708 


31897 at 


0.493 


DOC1 


3a12 3 


709 


37381_g at 


0.493 


GTF2B 


1o22-n^1 i 


710 


40280 at 


0.492 


B7 


12d13 


711 


40136 at 


0.492 


KIAA0676 


5q35.3 


712 


506 q at 


0.492 S 


STAT5A 


17q11,2 


713 


3581 2_at 


0.492 


TRN-SR 


7a31 1 


714 


41853 at 


0.491 


PRPSAP2 


17n1 1 9„n19 


715 


37022 at 


0.491 


PRELP 




716 


38079 at 


0.49 


GNG12 


1n31 9 


717 


36149_at 


0.489 


DPYSL3 


5q32 


718 


o i oou ai 


0.489 


D8S2298E 


8p12-p11.2 


719 
720 


37199 at 
37671_at 


0.488 
0.488 


CGI-60 


2p25.1-p24.1 


721 


31573_at 


0.488 


LAMA4 
RPS25 


6q21 
11q23.3 


722 


39696 at 


0.488 


PEG10 


7q21 


723 


39723 at 


0.487 


CUL1 


7q34-q35 


724 


1074 at 


0.486 


RAB1A 


2p14 


725 


32175 at 


0.486 


CDC10 


7p14.3-p14.1 
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Probe Sets 


CC 


Gene Symbol 


v»jf ivjy tsntf iiu 

Location 


726 


35364 at 


0.486 


APPBP1 


16q22 


727 


39019 at 


0.486 


LAPTM4A 


! 2p24.3 


728 


41772_at 


0.485 


MAOA 


Xp11.4-Dl1.3 


729 


31670_s_at 


0.485 


CAMK2G 


10q22 


730 


33426 at 


0.484 


CHGB 


20pter-p12 


731 


34393_r_at 


0.483 


RAB1A 


2p14 


732 


40903_at 


0.483 


ATP6IP2 


Xq21 


733 


41251 at 


0.483 


TRIP3 


I 17q21 1 


734 


39054_at 


0.482 


GSTM1 


1p13.3 


735 


3391 2_at 


0.482 


ZMPSTE24 


1p34 


j 736 


40709_at 


0.482 


ZNF271 


18q12 


737 


38662_at 


0.482 


BCRP1 


14q24 1 


738 


32755_at 


0.481 


ACTA2 


10q23.3 


739 


39741 at 


0.481 


HADHB 


2p23 


740 


35169 at 


0.481 


! COL16A1 


1 d35-d34 ^ 


741 


40555_at 


0.48 


TC10 


2p21 1 


742 


315_at 


0.48 


' PRDM2 


1p36 


743 


39180 at 


0.48 


FUS 


16p11.2 


744 


35253 at 


0.479 


GAB2 


11q13.4 


745 


31536 at 


0.479 


RTN4 


2p14-p13 


746 


767_at 


0.478 


MYH11 


16d13 13-d1*} 19 


\ 747 


39797_at 


0.477 


KIAA0349 


6p21.1 


748 


32854_at 


0.477 


FBXW1B 


5a35 1 


749 


41191_at 


0.477 


KIAA0992 


4a32 3 


750 


1151 at 


0.476 


RPL22 


1p36.3-p36.2 


751 


35294 at 


f\ at a 


SSA2 


1q31 


1 752 


1708 at 


0.475 


MAPK10 


4a22 1-a2^ 


753 
754 


39031 at 
37747_at 


0.475 
0.475 


COX7A1 
ANXA5 


I9a13 1 ^ 
4a28-a32 


755 
756 


38433 at 
38049_g_at 


0.475 
0.473 


AXL 
RBPMS 


19a13 1 
8d12-d11 


757 


36595 s at 


0.473 


GATM 


15a14 


758 
759 


39124 r at 
706_at | 


0.473 
0.473 


TRPC1 


3a22-n?4 


760 


37399 at 


0.47 


NR3C1 
AKR1C3 


5q31 
10p15-p14 


761 


oo^oo at 


0.47 


RBBP6 


16p12-p11.2 


762 


32563_at 


0.469 


ATP1B3 


3q22-q23 


763 


37734 at 


0.469 


DIP2 


21q22.3 


764 


39245 at 


0.469 


? 




765 


40618 at 


0.468 


H41 


3q22.2 


766 


31886_at 


0.467 


NT5E 


6q14-q21 


_767 


41807 at 


0.467 


cDNAFU31959fis 




768 


1447 at 


0.467 


PSMB1 


6q27 
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Probe Sets 

41227 at 



CC 



Gene Symbol 



Cytogenetic 
Location 



0.466 
0.465 



OCRL 



Xq25-q26.1 



770 



774_g_at 
35794 at 



MYH11 



16p13.13-p13.12 



772 



32254_at 
38694 at 



0.465 
0.464 



EFA6R 



FSTL3 



8pter-p23.3 



19p13 



774 



775 



776 



777 



34391 at 



39183 at 



31894 at 



39260 at 



0.464 



0.464 



0.463 



0.463 



0.462 
0.462 



KIAA0738 



IGBP1 



PCTK1 



CENPC1 



SLC16A4 



7q33 



Xq13.1-q13.3 



Xp11.3-p11.23 



4q12-q13.3 



1p13.1 



778 



779 



232 at 



1380 at 



0.461 
0.46 



LAMC1 



FGF7 



1q31 



15q15-q21.1 



41737_at 
36851_g_at 



SRRM1 



1p36.11 



781 
782 



783 
784 



41488 at 



39623 at 



0.46 



0.459 



0.457 
0.456 



N33 



LOC57149 



NDP 



8p22 



16p11.2 



Xp11.4 



785 



786 



787 



788 



36970 at 



40086 at 



33421 s at 



40047 at 



39420 at 



0.456 



0.455 



0.454 



0.452 
0.451 



KIAA0182 



KIAA0261 



SC5DL 



SBB103 



DDIT3 



16q24.1 



10q23.31-q23.32 



11q23.3 



12q12 



12q13.1-q13.2 



790 
791 



792 
793 



794 



795 



796 



797 



798 



799 



37951 at 



855 at 



35739 at 



33451 s at 



1278 at 



38542 at 



31896 at 



33341 at 



1846 at 



288 s at 



31672_g_at 
36120 at 



0.45 



0.45 



0.449 



0.448 



0.445 



0.44 



0.439 



0.439 



0.438 



0.438 



DLC1 



PDCD2 



MTMR3 



RPL22 



AXL 



NAG 



GNB1 



LGALS8 



LBR 



RBMS1P 



8p22-p21.3 



6q27 



22q12.2 



1p36.3-p36.2 
19q13.1 



2p24 



1p36.33 



1q42-q43 



1q42.1 



800 
801 



803 



804 



805 



806 



807 



808 



809 



810 
811 



38371_at 
31812 at 



35311 at 



41837 at 



39775 at 



37765 at 



39733__at 
38075 at 



41289 at 



38459_g at 
40461 at 



0.437 



0.437 
0.436 



0.436 
0.436 



0.436 



0.433 



0.431 



0.43 



0.43 



0.429 
0.427 



FVT1 



PSMA1 
GMPR 



CREG 



DKFZp761F2014 



SERPING1 



LMOD1 



HERPUD1 



SYPL 



NCAM1 



CYB5 



18q21.3 



11p15.1 
6p23 



1q24 



14q32.2 



11q12-q13.1 



1q32 



16q12.2-q13 



7q11.23 



11q23.1 



18q23 



TIX1 



20q12 
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Probe Sets 


CC 


Gene Symbol 


v^ywjyeneiic 
Location 


855 


1737 s at 


0.366 


IGFBP4 


17q12-a21 1 


856 


32847 at 


0.363 


MYLK 


3a21 


857 


37732_at 


| 0.363 


RYBP 


3p13 


858 


32184 at 


0.358 


j LM02 


1 1p13 


859 


41046 s at 


0.357 


2NF261 


Xq13.1 


860 


'< 40487 at 


0.356 


MC7 


11d11 2 1 


861 


32067 at 


0.353 


CREM 


10d12 1-d1 1 1 


862 


39561 at 


0.352 


DNAL4 


22a1 3 1 


863 


36569_at 


0.347 


TNA 


3d22-d21 ^ 


864 


39373 at 


0.345 


£ FADS1 


11a12 2-n1^ 1 


865 


38466 at 


0.344 


CTSK 


1a21 


866 


34784 at 


0.336 


DJ37E16.5 


! 22cen-a19 1 


867 


37842_at 


0.336 


HIC 


7n91 11 


868 


33255 at 


0.334 


NASP 


8a 11 O'K 


869 


1005_at 


0.329 


DUSP1 


5q34 


870 


41864 at 


0.329 


? 




871 


1241_at 


0.327 


PTP4A2 


1p35 


872 


i 38228 a at 




MITF 


3p14.1-p12.3 


873 


32340_s at 


0.322 


NSEP1 


1p34 


874 


38312_at 


0.316 


cDNA DKFZp5640222 




875 


3231 3_at 


0.305 


TPM2 


9d13 2-r>n 1 


876 


773_at 


0.302 


MYH11 


16d13 13-n1^ 19 


877 
878 
879 
880 


36065_at 
39066_at 
34826 at 
38430 at 


0.301 
0.29 
0.288 
0.286 


LDB2 
MFAP4 
SDHA 


4p16 
17p11.2 
5p15 


881 
882 
883 


31855_at 
33440 at 
40856 at 


0.274 
0.273 
0.258 


FABP4 
SRPX 
TCF8 
SERPINF1 


Rn91 
Xp21.1 
10p11.2 
17p13.1 


884 


40282_s at 


0.251 


DF 


19P13.3 


i 885 
886 
887 
888 


36165 at 
36201 at 
36521 at 
3693 1_at 


0.234 
0.234 
0.233 
0.225 


COX6C 
GL01 
DZIP1 

TAGLN 


6p21.3-p21.1 
13q32.1 
11q23.2 


889 
890 
891 
892 
893 


32314 g at 

33790_at 
38734 at 
39690 at 


0.222 | 

0.206 

0.203 

0.198 

0.182 


TPM2 
RANBP16 
CCL1 4 
PLN 
ALP 


9p13.2-p13.1 
8p21 
17q11.2 
6q22.1 
4q35 


894 
895 
896 " 
897 


31830 s at 
31831 at 
34203 at 
1197 at 


0.176 
0.157 
0.15 
0.134 


SMTN 
SMTN 
CNN1 
ACTG2 


22q12.2 | 
22q12.2 
19p13.2-p13.1 I 
2p13.1 
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Probe Sets 


CC 


Gene Symbol 


Cytogenetic 
Location 


898 


38995 at 


0.114 


CLDN5 


22q11.21 


899 


38994_at 


0.114 


SOCS2 


12q 


900 


36892 at 


0.042 


ITGA7 


12q13 
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TablelO. Ranking of the Top 100 Probe Sets Based on PCC Values 



Probe Set 
Name 


R1 


R2 




Probe set 
name 


R1 


R2 




Probe set 

nam a 


Kj 


R2 


37628 at 


0.865 


0.808 




! 37529_at 


0.669 


0.494 




201_s_at 


0.577 


0.571 


4iooy at 


0.865 


0.877 




32175 at 


0.669 


0.486 




774_g_at 


0.576 


0.465 


ooizu at 


0.852 


0.768 




35753 at 


0.667 


0.535 




40998 at 


0-576 


0.539 


OZ004 at 


0.848 


0.749 




38875 r at 


0.667 


0.495 




! 41772_at 


0.573 


0.485 


OC747 

oof if at 


S\ OAT 

0.847 


0.783 




32779 s at 


0.665 


0.599 




I 40522 at 


0.572 


0.544 


o4257 at 


0.846 


0.786 




41385 at 


0.665 


0.594 




41768 at 


0.571 


0.513 


o8220 at 


0.844 


0.820 




1319_at 


0.664 


0.580 




37828 at 


0.569 


0.677 


[ 40423_at 


0.844 


0.785 




32593 at 


0.664 


0.692 




280_g_at 


0.569 


0.377 


oooou at 


0.840 


0.748 




38101 at 


0.663 


0.615 




33431 at 


0.568 


0.536 


oo4oy at 
oyo/o i at 


0.829 
0.827 


0.736 
0.698 




39864 at 


0.663 


0.698 




1278 at 


0.566 


0.448 


OOU4 ( at 


0.822 


0.756 




39037 at 
32057_at 


0.663 
0.663 


0.726 
0.667 




35736 at 
37985 at 


0.566 
0.565 


0.508 
0.608 


loyo at 


0.821 


0.756 




38518 at 


0.663 


0.517 




38326 at 


0.565 


0.417 


37015 at 


0.820 


0.751 




39556 at 


0.662 


0.725 




37197 s at 


0.562 


0.647 


4U /y o at 


0.818 


0.666 




40841 at 


0.661 


0.618 




41529_g_at 


0.560 


0.620 


32145 at 


0.817 


0.786 




39117 at 


0.660 


0.557 




39838 at 


0.560 


0.582 


35742 at 


0.815 


0.708 




36695_at 


0.659 


o an? 




^u»2_s__at 


0.560 


0.679 


1290_g_at 


0.814 


0.776 




39681_at 


0.659 


0.690 




31672_g_at 


0.559 


0.438 


oyo74 r at 


0.811 


0.699 




1850_at 


0.659 


0.529 




37701_at 


0.559 


0.391 


ooyi7 at 


0.810 


0.752 




3801 3_at | 


0.657 


0.672 




40260 _g_at 


0.558 


0.516 


1897_at 


0.809 


0.770 




1377_at 


0.656 


0.605 




33249 at 


0.558 


0.618 


I 7oo_at 


0.809 


0.822 




41739 s at 


0.655 


0.552 




33198_at 


0.557 


0.705 


38176_at 


0.805 


0.729 




36095 at 


0.654 


0.513 




1708_at j 


0.557 


0.475 


OO073 at 
o^/ r b4 at 


0.804 
0.803 


0.774 
0.691 




35169 at 
36533_at j 


0.653 
0.653 


0.481 
0.517 




38116 at 
40832 s at 


0.556 
0.555 


0.595 
0.505 


39750 at 
35645 at 
38717 at 


0.796 
0.795 
0.793 


0.718 
0.714 
0.789 




40698 at 
33936 at 


0 651 
0.651 


0.702 




37958 at 
1787 at 


0.555 
0.555 


0.617 
0.374 


37394_at 


0.791 


0.656 




1507 s at 
38211 at 


0.648 
0.648 


0.596 
0.681 




333 s at 
36690_at 


0.554 
0.554 


0.562 
0.654 


36160 s at 


0.788 


0.846 




38351_at i 


0.648 


0.527 




40839 at 


0.553 


0.611 


36867 at 
39852_at 


0.785 
0.783 


0.739 
0.738 




32239 at 
32582 at 


0.647 
0.646 


0.548 
0.578 




32569 at 
33302_at 


0.553 
0.553 


0.534 
0.619 


37643 at 
40767_at 


0.779 
0.779 


0.716 
0.640 




31880 at 
38342 at 


0.646 
0.646 


0.489 
0.632 




1058 at 
38990 at 


0.552 
0.552 


0.668 
0.526 


41449 at 


0.778 


0.762 




33240 at 


0.645 


0.511 




506 s at 


0.551 


0.492 


40488_at 


0.778 


0.698 




33136_at 


0.645 


0.628 




871_s_at 


0.550 


0.630 
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Probe Set 


r\ 1 


Kg | 


41747 <5 at 


a 7nc 


O EOT I 

0.587 | 


3531 R at 


a *7AO 

o. suy 


/"l "Tin I 


3R50P* c of 
oojuo o dl 


A "7AO 
O. fUO 


0.542 | 


OOOH'f at 


0.707 


0.535 I 


vOOQD at 


0.707 


0.550 l 


37005 of 
of uuo ai 


0.706 


0.547 | 


40QR1 at 


O. fUo 


0.745 | 


3RQ4R of 

wOWO dl 


0.703 


0.756 I 


3Q74^ at 

o5?#*to at 


0.702 


0.775 I 


3Q3RQ at 


0,702 


0.635 I 


3R5QR r at 

OUvJOt) 1 dl 


A *7A** 


0.575_J 


3Q0*3P. ot 
oawoo ai 


0.700 


0.576J 


851 q of 


n /?nn 

c/.oyy 


0. 725 | 


41R55 ot 

" i uju dl 


u. oy 9 


0.664 I 


3594R ot 
oj£*fu at 


0.697 


0.652 J 


3ft**17 of 

woo i r ai 


0. 696 


0.593 | 


379Q4 ot 
O / ai 


0.696 


0.649 I 


351RP, f of 


0.696 


0.540 1 


3R15Q o of 

•JO 1 5357 5 ai 


0.696 


0.589J 


35759 e ot 


0.695 


0.642\ 


35395 at 
OgO^O ai 


0.694 


0.577 I 


71 P. of 

( io at 


0.693 


0.540 | 


3770ft r ot 
oti uo r at 


0.692 


0.545 J 


343R^ at 

ohodo at 


0.691 


0.665| 


POftfi e ot 

fcwu o at 


A £2f\>4 

0.691 


0.637 | 


3740 R of 
o r *tuo ai 


0.690 


0.665 J 


417QR at 


a /son 


0.779 | 


35331 at 

^wwj i dl 


A ADQ 


0.565 | 


173R at 

i / oo at 


ri COD 
0.688 


0.688| 


32107 at 


A COD 

O.Ooo 


0.598 [ 


34853 at ! 

VTUJO dl 


a ADD 


0.538 J 


3R3QR at 


a ADD 
O.OOO 


0.785 J 


32780 at 


0.687 


0 504 I 


32254 at 


0.687 


0.464 I 


37248 at 


0.687 


0.536 


33800_at 


0.687 


0.579 I 


38837_at 


0.686 


0.584 I 


39360 at 


0.685 


0.640 


41638 at 


0.683 


0.633 


34198^at | 


0.683 I . 


0.638 j 



Probe set 
name 


R1 


I R2 


39260 at 


0.606 


' 0.462 " 


! 39294 at 


0.606 


0.595 


2062 at 


0.605 


0.528 


31886_at 


0.605 


0.467 


38695 at 


0.605 


0.775 


767 at 


Q304 


0.478 


33878 at 


0.604 


0.657 


32119 at 


0.603 


0.515 


38228_g at 


0.602 


0.323 


39979 at 


0.602 


0.588 


35740 at 


0.601 


0.397 


32768 at 


0.601 


0.648 


34637_f at 


0.600 


I 0.678 


35794 at 


0.600 


j 0.465 


36792 at 


0.599 


0.521 


39366 at 


0.599 


0.529 


40770 f at 


0.599 


0.655 


38673 s at 


0.598 


0.497 


1675_at 


0.598 


' 0.535 


33399_at 


0598 


0.544 


32700_at 


0.598 


0.592 


32337 at 


0.597 


0.589 


41872 at 


0.597 


0.538 


1836 at 


0.597 I 


0.543 


176 at 


0.596 I 


0.654 


37598 at 


0.595 I 


0.551 


41490 at 
37266 at 


0.593 
0.593 


0.525 


39054 at 


0.592 I 


0.548 
0.482 


31936_s at 


0.59Y I 


0.565 


1629_s_at 


0.591 


0.692 


40461 at 


0.588 


0.427 


39031 at 


l/. woo 


0.475 


1529_at 


0.588 


0.586 


39555 at 


0.588 T 


0.517 


1005 at 


0.588 


0.329 


38745 at 


0.587 


0.702 


35784 at 


0.587 


0.622 


706 at 


0.587 


0.473 


41620_at 


0.587 [ 


0.618 



Probe set 
name 

35955 at 



38459_g at 



38466 at 



32708_g_at 



36521 at 



37315_f_at 



33440 at 



35843 at 



34235 at 



479 at 



R1 

0.496 



0.494 
0.493 



0.493 



0.493 



0.492 



0.492 



0.491 



0.491 



40140_at 
41379 at 



0.490 



39623 at 



41277 at 



36458 at 



35166 at 



654 at 



38826 at 



33915 at 



38626 at 



38074 at 



35730 at 



31932 f at 



0.490 
0.490 



0.489 



0.489 



0.488 



0.488 



0.488 



0.488 



0.429 



0.344 



0.627 



0.602 



0.273 



0.370 



0.666 
0.524 



0.457 



0.632 
0.372 



0.544 
0.643 



0.487 



0.487 



0.486 



0.505 



0.599 



0.486 



34803 at 



32066_g at 



37199 at 



33123 at 



36120 at 



0.485 



0.485 



0.484 



0.484 



0.483 



35294 at 



39856_at 
1323_at 
33830 at 



38743_f_at 
41656_at 
38106_at 
39420_at 
3911Q_at 
31463_s at 
41686_s_at 
41289 at 



0.483 



0.483 



0.482 



0.482 
0.481 



0.481 
0.480 
0.479 
0.479 
0.478 
0.478 
0.476 
0.476 



0.509 
0.634 



0.515 



0.586 



0.633 



0.407 



0.488 



0.605 



0.437 



0.476 



0.620 



0.604 
0.679 



0.524 
0.661 
0.497 
0.452 
0.566 
0.500 
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Probe Set 
Name 

38033 at 


R1 

0.683 


fi 1Q7 


Probe set 
name 


R1 


R2 


1467 at 


0.682 


U. wwV 

fi m 


226 at 
35739 at 


0.587 
0.586 


0.657 
0.450 


40203 at 

■ \r is V# Wife 


w. UUt 


fi Rftn 
u.oou 


37205 at 


0.586 


0.645 


38685 at 


o 68 fi 

w» UOl/ 


fi rag 
u.ooo 


2010 at 


0.585 


0.732 


35741 at 

WW f ~ 1 CI I 




fi A'fO 


37391 at 


0.585 


0.567 


39150 at 

WW 1 CI k 


w. UOU 


fi RPQ 

u.ooy 


38433 at 


0.585 


0.475 


37604 at 

W f WUT CI I 


\J.\jf & 


U.Of 3 


39165 at 


0.584 


0.710 


38812 at 

Www 1 CI L 


fi R7Q 

w. O/ C7 


f\ con 

u.ouy 


1677 at 


0.584 


0.555 


32696 at 

w«.www CI I 


y.ofo 


U.oU4 


1380 at 


0.583 


0.461 


38375 at 

WWW 1 \J CI L 


fi R78 
U.OfO 


U. fl3 


34842 at 


0.583 


0.759 


' 38254 at 


U.OfO 


0.601 


35767 at 


0.583 


0.662 


39089 at 

wJ/UOt eft 


U.Of / 


0.587 


36825 at 


0.583 


0.587 


32215 i at 

wC^. 1 w 1 ell 


U.Of f 


u.o4u 


38385 at 


0.583 


0.529 


39582 at 


fi R77 
U.Of f 


U.OJO 


34675 at 


0.582 


0.540 


36915 at 

WWW I w d I 


fi R7A 
U.OfO 


u.720 


1737_s_at 


0.581 


0.366 


40576 f at 

• w w f w | CH 


fi R7f> 
U.OfO 


U.ooZ 


38982 at 


0.580 


0.719 


539 at 


0.675 


0.556 


41971 at 

"ifci i at 


0.580 


0.560 


37595 at 


0.674 


0.618 


36595_s_at 


0.579 


0.473 


33868_at 


0 67? 

W.V/ w 


i/.oo/ 


! 38923_at 


0.578 


0.605 


37676 at 

w r w r w dL 


fi R70 
U.Ofd 


0.546 


37765 at 


0.578 


0.433 


^Q194 r at 

o*7 r or 


U.Of 3 


0.473 


37373_at 


0.578 


0.710 


ooo4y at 


0.673 


0.520 


37389 at 


0.578 


0.526 


41771_g_at 


0.673 


0.687 


38916 at 


0.578 


0.651 


227_g_at 


0.672 


0.733 


I 33899 at 


0.578 


0.555 


Probe Set 
Name 


R1 


R2 


Probe Set 
Name 


R1 


R2 


37486 f at 


0.459 


0.615 


1848 at 


0.288 


0.591 


38802 at 


0.458 


0.739 


34203_at 


0.288 


0.150 


32695_at 


0.458 


0.722 


40280 at 


0.286 


0 492 


35142_at 


0.457 


0.504 


1 37359 at 


0.286 


0.507 


149 at 


0.457 


0.544 


1 33790 at 


0.285 


0.203 


39561 at 


0.457 


0.352 


38542 at 


0.283 


0.445 


33835 at 


0.457 


0.496 


33870_at 


0.281 


0.589 


W^^L*+*T Ol 


0.455 


0.576 


869 at 


0.281 


0.518 


32313 at 


0.453 


0.305 


J 315_at 


0.277 


0.480 


37242 at 


0.452 


0.753 


36970 at 


0.275 


0.456 


38353 at 


0.451 


0.628 


38782 at 


0.275 


0.555 


32743 at 


0.451 


0.577 


39739 at 


0.275 


0.502 


36149_at 


0.449 


0.489 


41662 at 


0.275 


0.661 


38050_at 


0.449 


0.599 


37748_at 


0.274 


0.617 


39072_at 


0.448 


0.662 


33441 at 


0.272 


0.561 



Probe set 
name 



34860_g_at 



R1 

0.476 



R2 

0.593 



38980 at 



0.475 



0.539 



36636 at 



0.475 



0.590 



36423 at 



0.474 



0.523 



36065 at 



0.473 



0.301 



1873 at 



0.473 



0.507 



35055 at 



0.472 



0.586 



38694 at 



0.471 



0.464 



35276 at 



0.471 



0.575 



33799 at 



0.469 



773 at 



0.469 



0.607 



0.302 



40211 at 



0.467 



35303 at 



41283 at 



0.465 
0.464 



0.636 



0.554 



0.496 



37195 at 



40811 at 



0.463 



0.463 



1348 s at 



0.463 



0.411 
0.552 
0.718 



32038_s_at 



0.462 



0.542 



33856 at 



0.462 



0.542 



36543 at 



0.461 



0.648 



37022 at 



0.461 



38610_s_at 
33444_at 
32713 at 



0.459 
0.459 
0.459 



0.491 
0.549 
0.505 
0.598 
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Probe Set 
Name 


R1 


R2 


Probe Set 
Name 


R1 


R2 


32777_at 


0.447 


0.743 


35228 at 


0.272 


0.624 


38458_at 


0.447 


0.417 


32160 at 


0.269 


0.576 


37731 at 


0.446 


0.602 


41853 at 


0.268 


0.491 


38265_at 


0.446 


0.470 


41746 at 


0.268 


0.523 


41194 at 


0.445 


0.533 


32854 at 


0.268 


0.477 


I 40856 at 


0.445 


0.258 


1070 at 


0.267 


0.601 


36650 at 


0.444 


0.612 


35737 at 


0.266 


0.583 


39431 at 


0.443 


0.622 


^ 232_at 


0.266 


0.462 


39380_at 


0.443 


0.544 


34785 at 


0.259 


0.392 


37131 at 


0.443 


0.585 


34792 at 


0.259 


0.510 


32576 at 


0.442 


0.502 


36542 at 


0.257 


0.686 


36991 at 


0.441 


0.543 


192_at 


0.255 


0.643 


324 f at 


0.441 


0.562 


36330_at 


0.255 


0.511 


41837 at 


0.440 


0.436 


35364 at 


0.254 


0.486 


34753 at 


0.439 


0.587 


1846 at 


0.252 


0.439 


40083 at 


0.438 


0.531 


32172 at 


0.252 
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